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PLANT STEROL REDUCTASES A ND USES THEREOF 

5 

Bfrpkgroynd pf the Inventipn 

This application relates to plant sterol biosynthetic enzymes, genes, and their 

uses. 

Plant sterols belong to a large group of secondary compounds known as 

10 terpenes or isoprenoids. Sterol biosynthesis in plants generally involves a series of 
different enzymatic steps in the isoprenoid pathway that result in the formation of a 
variety of sterol end products (Benveniste Ann. Rev. Biochem. 37:275, 1986). 
Although such sterol compounds have been identified in higher plants, their function 
in plant growth and development is poorly understood. 

15 One such plant sterol, brassinolide, that belongs to a class of sterols referred to 

as brassinosterioids (BR), was first discovered in the pollen of Brassica napus (Grove 
et. al., Nature 281 : 216, 1979). Brassinosteroids are growth-promoting natural 
products having structural similarities to animal steroid hormones. The wide 
distribution of brassinosteroids in the plant kingdom, their effect on cell proliferation 

20 and elongation, and their interactions with other plant hormones (e.g., cytokinins), 

have indicated that these compounds are plant-growth regulators. Brassinosteroids are 
thought to promote hypocotyl elongation, leaf unrolling, and xylem differentiation. In 
addition, such compounds are also believed to be involved in de-etiolation of 
cotyledons, root elongation, radial growth, and anthocyanin formation. 

25 The function of plant sterol growth regulators, such as BR, in relationship to 

other classes of plant growth regulators such as auxin, gibberellin, abscisic acid, and 
cytokinin, during plant development also needs to be evaluated. For example, the 
growth regulator, cytokinin, is known to affect a variety of developmental processes 
including photomorphogenesis, chloroplast biogenesis and maintenance, apical 
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dominance, and senescence. In addition, this growth regulator is thought to 
antagonize BR's ability to promote hypocotyl elongation and cotyledon de-etiolation. 



Summary of the Invention 
5 In general, the invention features a substantially pure plant C-14 sterol 

reductase polypeptide. Preferably, the C-14 sterol reductase polypeptide includes an 
amino acid sequence substantially identical to the sequence shown in Fig. 14 (SEQ ID 
NO: 1); and is from a dicot (for example, a crucifer or a solanaceous plant), monocot, 
gymnosperm, or an alga. 

10 In related aspects, the invention features purified DNA that includes a sequence 

encoding a C-14 sterol reductase polypeptide (for example, a sequence substantially 
identical to the DNA sequence shown in Fig. 14; SEQ ID NO: 2; or a DNA sequence 
that encodes a C-14 sterol reductase polypeptide which has an amino acid sequence 
substantially identical to that shown in Fig. 14; SEQ ID NO: 1). The invention also 

15 features a vector and a cell, each of which includes purified DNA encoding a C-14 
sterol reductase polypeptide; and a method of producing a recombinant C-14 sterol 
reductase polypeptide involving providing a cell (for example, a plant cell) 
transformed with purified DNA encoding a C-14 sterol reductase polypeptide 
positioned for expression in the cell, culturing the transformed cell under conditions 

20 for expressing the DNA, and isolating the recombinant C-14 sterol reductase 
polypeptide. The invention further features recombinant C-14 sterol reductase 
produced by such expression of a purified DNA, and an isolated antibody that 
specifically recognizes and binds a plant C-14 sterol reductase polypeptide. 

In addition, the invention features nucleotide sequences that hybridize to a C-14 

25 sterol reductase gene (including the coding sequence of such a gene and its 

complement) and that encode a C-14 sterol reductase polypeptide. Furthermore, the 
invention includes oligonucleotide probes that detect a C-14 sterol reductase gene or 
functional equivalents thereof in a plant (for example, dicots (such as solanaceous and 
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cruciferous plants), monocots, gymnosperms, and algae). Such probes are useful to 
isolate DNA sequences that encode C- 14 sterol reductases from other plants. In one 
particular example, oligonucleotides may be designed based on a C-14 sterol reductase 
sequence disclosed herein and used as hybridization probes or as primers in 
5 polymerase chain reactions (PCR). Conserved regions in the C-14 sterol reductase 
gene are useful in the design of such primers to facilitate the recovery of C-14 sterol 
reductases from other related and unrelated plants. 

In yet other related aspects, the invention features a transgenic plant (or seeds 
or cells thereof) containing DNA encoding a C-14 sterol reductase polypeptide 

10 integrated into the genome of the plant, where the DNA is expressed in the transgenic 
plant, resulting in the production of a C-14 sterol reductase polypeptide. 

In still another aspect, the invention features a method for reducing the level of 
a plant C-14 sterol reductase polypeptide in a transgenic plant cell. This method 
generally involves expressing in the transgenic plant cell an antisense C-14 sterol 

15 reductase polypeptide nucleic acid sequence. In general, such an antisense C-14 sterol 
reductase nucleic acid sequence is encoded by a transgene integrated into the genome 
of the transgenic plant cell and is based on the nucleotide sequence that is shown in 
Fig. 14 (SEQ ID NO: 2) or Fig. 15. (SEQ ID NO: 3). In preferred embodiments, the 
plant cell expressing an antisense C-14 sterol reductase nucleic acid sequence is a 

20 dicot (for example, crucifer), monocot, gymnosperm, or algal cell. In yet other 
preferred embodiments, the method involves growing a transgenic plant from the 
transgenic plant cell, whereby the level of the C-14 sterol reductase polypeptide is 
reduced in the transgenic plant. 

In other related aspects, the invention features a plant cell expressing an 

25 antisense C-14 sterol reductase nucleic acid sequence and a plant expression vector 
that includes an antisense C-14 sterol reductase nucleic acid sequence, where the 
antisense sequence is operably linked to an expression control region. 
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In another aspect, the invention features a method for increasing the level of a 
C-14 sterol reductase in a transgenic plant cell. This method involves expressing in 
the transgenic plant cell a C-14 sterol reductase polypeptide nucleic acid sequence. 
Preferably, the method utilizes a C-14 sterol reductase nucleic acid sequence that is 
substantially identical to the nucleotide sequence that is shown Fig. 14 (SEQ ID NO: 
2). In preferred embodiments, the plant cell expressing a C-14 sterol reductase 
polypeptide nucleic acid sequence is a dicot (for example, a crucifer), monocot, 
gymnosperm, or algal cell. 

In another aspect, the invention features a transgenic plant having a knockout 
mutation in DNA encoding a plant C-14 sterol reductase polypeptide. Such knockout 
genes are constructed according to conventional methods 
(e.g., Lee et al. Plant Cell 2: 415, 1990; Miao and Lam, Plant J. 7: 359, 1995). 

By "plant C- 1 4 sterol reductase" is meant an amino acid sequence that 
catalyzes the reduction of any sterol precursor having a C14=C15 double bond, for 
example, as described by Benveniste, Annu. Rev. Biochem. 37: 275, 1986. Preferably, 
such a polypeptide has an amino acid sequence which is at least 30%, preferably 40%, 
and most preferably 50% or even 80-95% identical to the amino acid sequence of the 
C-14 sterol reductase polypeptide shown in Fig. 14 (SEQ ID NO: 1). The length of 
comparison of amino acid sequences will generally be at least 16 amino acids, 
preferably at least 20 amino acids, more preferably at least 25 amino acids, and most 
preferably at least 35 amino acids. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of 
length or post-translational modification (for example, glycosylation or 
phosphory 1 ation) . 

By a "substantially identical" polypeptide sequence is meant an amino acid 
sequence that differs only by conservative amino acid substitutions, for example, 
substitution of one amino acid for another of the same class (e.g., valine for glycine, 
arginine for lysine, etc.) or by one or more non-conservative substitutions, deletions, 
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or insertions, located at positions of the amino acid sequence that do not destroy the 
function of the polypeptide (assayed, for example, as described herein). 

Sequence identity is typically measured using sequence analysis software (for 
example, Sequence Analysis Software Package of the Genetics Computer Group 
5 (University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705), BLAST, or PILEUP/PRETTYBOX programs). Such software matches 
sequences by assigning degrees of homology to various substitutions, deletions, and 
other modifications. 

By "substantially pure polypeptide" is meant a polypeptide preparation that is at 

10 least 60% by weight (dry weight) the compound of interest, for example, the C-14 
sterol reductase polypeptide or C-14 sterol reductase-specific antibody. Preferably, 
the preparation is at least 75%, more preferably at least 90%, and most preferably at 
least 99%, by weight the compound of interest. Purity can be measured by any 
appropriate method, for example, column chromatography, polyacrylamide gel 

15 electrophoresis, or HPLC analysis. 

By "purified DNA" is meant DNA that is not immediately contiguous with both 
of the coding sequences with which it is immediately contiguous (one on the 5' end 
and one on the 3 ! end) in the naturally-occurring genome of the organism from which 
it is derived. The term therefore includes, for example, a recombinant DNA that is 

20 incorporated into a vector; into an autonomously replicating plasmid or virus; or into 
the genomic DNA of a prokaryote or eukaryote, or that exists as a separate molecule 
(for example, a cDNA or a genomic DNA fragment produced by PCR or restriction 
endonuclease treatment) independent of other sequences. It also includes a 
recombinant DNA that is part of a hybrid gene encoding one or more additional amino 

25 acids. 

By a "substantially identical" nucleic acid is meant a nucleic acid sequence that 
encodes a polypeptide differing only by conservative amino acid substitutions, for 
example, substitution of one amino acid for another of the same class (e.g., valine for 
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glycine, arginine for lysine, etc.) or by one or more non-conservative substitutions, 
deletions, or insertions, located at positions of the amino acid sequence that do not 
destroy the function of the polypeptide (assayed, for example, as described herein). 
Again, the encoded sequence is at least 30%, more preferably 40%, and most 
preferably 50%, or even 80 to 95% identical at the amino acid level to the sequence of 
Fig. 14 (SEQ ID NO: 1). Thus, when nucleic acid sequences are compared, a 
"substantially identical" nucleic acid sequence is one which is at least 30%, more 
preferably 40%, and most preferably 50%, or even 80 to 95% identical to the sequence 
of Fig. 14 (SEQ ID NO: 2). The length of nucleic acid sequence comparison will 
generally be at least 30 nucleotides, preferably at least 60 nucleotides, more preferably 
at least 75 nucleotides, and most preferably 110 nucleotides. Again, identity is 
typically measured using sequence analysis software (for example, Sequence Analysis 
Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). 

By "isolated antibody" is meant antibody that is at least 60%, by weight, free 
from the proteins and naturally-occurring organic molecules with which it is naturally 
associated. Preferably, the preparation is at least 75%, more preferably at least 90%, 
and most preferably at least 99%, by weight, antibody. 

By "specifically binds" is meant an antibody that recognizes and binds a C-14 
sterol reductase polypeptide but which does not substantially recognize and bind other 
molecules in a sample (e.g., a biological sample) which naturally includes a C-14 
sterol reductase. An antibody which "specifically binds" a C-14 sterol reductase is 
sufficient to detect a C-14 sterol reductase product in such a biological sample using 
one or more of the standard immunological techniques available to those in the art (for 
example, Western blotting or immunoprecipitation). 

By "an antisense C-14 sterol reductase sequence" is meant a nucleotide 
sequence that is complementary to a plant C-14 sterol reductase messenger RNA. In 
general, such an antisense sequence will usually be at least 15 nucleotides, preferably 
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about 15-200 nucleotides, and more preferably 200-2,000 nucleotides in length. The 
antisense sequence may be complementary to all or a portion of the plant C-14 sterol 
reductase mRN A nucleotide sequence, and, as appreciated by those skilled in the art, 
the particular site or sites to which the antisense sequence binds as well as the length 
5 of the antisense sequence will vary, depending upon the degree of inhibition desired 
and the uniqueness of the antisense sequence. By binding to the appropriate target 
sequence, an RNA-RNA, DNA-DNA, or RNA-DNA duplex is formed. A 
transcriptional construct expressing a plant C-14 sterol reductase antisense nucleotide 
sequence includes, in the direction of transcription, a promoter, the sequence coding 

10 for the antisense RNA on the sense strand, and a transcriptional termination region. 
Antisense C-14 sterol reductase sequences may be constructed and expressed as 
described herein or as described, for example, in van der Krol et al., Gene 72: 45, 
1988; Rodermel et al., Cell 55: 673, 1988; Mol et al., FEBS Lett. 268: 427, 1990; 
Weigel and Nilsson, Nature 377: 495, 1995; Cheung et al., Cell 82, 383, 1995; and 

15 U.S. Pat. No. 5,107,065. In addition, C-14 sterol reductase antisense sequences are 
useful for the formation of triple helices, where the antisense sequence is bound to a 
DNA duplex. By binding to the target nucleic acid, C-14 sterol reductase antisense 
sequences can inhibit the function of the target nucleic acid. This results, for example, 
in the blocking of transcription, processing of poly A+ addition, replication, 

20 translation, or promoting inhibitory mechanisms of the cell, such as RNA degradation. 
The triple helix-forming and antisense C-14 sterol reductase sequences are useful for 
selectively suppressing certain cellular functions that are associated with C-14 sterol 
reductase activity. 

By a "transformed cell 11 is meant a cell into which (or into an ancestor of 
- 25 which) has been introduced, by means of recombinant DNA techniques, a DNA 

molecule encoding (as used herein) a C-14 sterol reductase polypeptide (for example, 
a substantially identical DNA encoding the C-14 sterol reductase shown in Fig. 14 
(SEQ ID NO: 2)). 
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By "positioned for expression" is meant that the DNA molecule is positioned 
adjacent to a DNA sequence which directs transcription and translation of the 
sequence (for example, facilitates the production of, for example, a plant C-14 sterol 
reductase polypeptide such as the amino acid sequence shown in Fig. 14 (SEQ ID NO: 
1)), or an RNA molecule (for example, an antisense RNA). 

By "promoter" is meant a minimal sequence sufficient to direct transcription. 
Included in the invention are promoter elements that are sufficient to render promoter- 
dependent gene expression controllable for cell-, tissue-, or organ-specific gene 
expression, or elements that are inducible by external signals or agents (for example, 
light-, pathogen-, wound-, stress-, or hormone-inducible elements); such elements may 
be located in the 5' or 3* regions of the native gene or engineered into a transgene 
construct. 

By "operably linked" is meant that a gene and a regulatory sequence(s) are 
connected in such a way as to permit gene expression when the appropriate molecules 
(for example, transcriptional activator proteins) are bound to the regulatory 
sequence(s). 

By "crucifer" is meant any plant that is classified within the Cruciferae family 
as commonly described in, e.g., Gray's Manual of Botany American Book Company, 
N.Y., 1950; Hortus Third: A Concise Dictionary of Plants Cultivated in the U.S. and 
Canada, Macmillan, 1976; or Simmons, N.W., Evolution of Crop Plants, 1986. The 
Cruciferae include many agricultural crops, including, but not limited to, broccoli, 
cabbage, brussel sprouts, rapeseed, kale, Chinese kale, cauliflower, horseradish, and 
Arabidopsis. 

By "plant cell" is meant any self-propagating cell bounded by a semi-permeable 
membrane and containing a plastid. Such a cell also requires a cell wall if further 
propagation is desired. Plant cell, as used herein includes, without limitation, algae, 
cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, callus 
tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 
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By "transgene" is meant any piece of DNA that is inserted by artifice into a 
cell, and becomes part of the genome of the organism which develops from that cell. 
Such a transgene may include a gene which is partly or entirely heterologous (i.e., 
foreign) to the transgenic organism, or may represent a gene homologous to an 
5 endogenous gene of the organism. 

By "transgenic" is meant any cell that includes a DNA sequence which is 
inserted by artifice into a cell and becomes part of the genome of the organism which 
develops from that cell. As used herein, the transgenic organisms are generally 
transgenic plants and the DNA (transgene) is inserted by artifice into the nuclear or 
10 plastidic genomes. 

Other features and advantages of the invention will be apparent from the 
following detailed description thereof, and from the claims. 

15 Deta i led De scription 

The drawings will first be described. 
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Figs. 1 A-1B are photographs showing that the ell mutant morphology was 
phenocopied by treating wild-type seedlings with 30 /^M of dimethylallylamino-purine 
(2ip), a synthetic cytokinin, in the dark (Fig. 1 A) or light (Fig. IB). From left to right 
5 in both Figs. 1 A-1B: wild-type plant; wild-type plant + 2ip; ell; and ell + 2ip. 

Fig. 2 is a photograph illustrating the constitutive photomorphogenesis of ell 
seedling development in the dark. Wild-type (left) and ell (right) seedlings were 
grown in the dark for twenty-one days on Murashige-Skoog (MS) plates containing 
two percent sucrose. 

10 Figs. 3A-3B are photographs showing that the rosette leaves of the ell plant 

(Fig. 3B) are darker green in color that those of the wild-type plant (Fig. 3A). 

Fig. 4 is a photograph illustrating that an ell mutant has reduced apical 
dominance in comparison to a wild-type plant Six-week-old wild-type (left) and ten- 
week-old ell (right) plants were grown in the greenhouse. 

15 Fig. 5 is a photograph showing that ell mutants (right) exhibit irregular, 

thickened cotyledons and hypocotyls, and reduced cotyledon petioles compared to 
wild-type plants (left). 

Figs. 6A-6B are photographs showing abnormal flower development in the ell 
mutant. Fig. 6 A shows, from left to right, that the sepal, petal, stamen, and carpel are 

20 shorter in ell (lower row) than wild-type (upper row) plants. Fig. 6B shows, from left 
to right, the top and side view of wild-type (left) and ell (right) flowers. 

Figs. 7A-7F are photographs showing embryo development in ell and wild- 
type plants. Wild-type and ell plants are shown in the left and right of each 
photograph, respectively. Fig. 7A shows ell embryo development at the 32- to 64-ceIl 

25 stage, and Fig. 7B shows that, when wild-type embryos have reached the heart stage, 
ell embryos are only at the globular stage. As shown in Fig. 7C and Fig. 7D, when 
the wild-type embryo reached the torpedo stage, the ell mutant embryo was at the 
heart stage. Fig. 7E shows that apical hooks were not formed in ell embryos. And 
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Fig* 7F shows that ell seeds desiccated without completing the late stages of 
embryogenesis. 

Figs. 8A-8B are photographs of dry seeds from wild-type (Fig. 8A) and ell 
plants (Fig. 8B). Reduced seed size, wrinkled seed coat, and precocious germination 
were observed in ell seeds. 

Figs. 9A-9F are photographs showing the supernumerary cotyledons that were 
observed in the ell mutant, including one (Fig. 9A), two (Fig. 9B), three (Fig. 9C), 
four (Fig. 9D), five (Fig. 9E), and more than six cotyledons (Fig. 9F). 

Figs. 10A-10B are illustrations showing various aspects of the molecular 
characterization of the Arabidopsis thaliana C-14 sterol reductase gene. Fig. 10 A is a 
schematic illustration showing the position of a T-DNA insertion into chromosome 3 
of Arabidopsis^ approximately forty base pairs upstream of the ELL gene, and the 
exon-intron structure of the C-14 sterol reductase gene. 

Fig. 1 OB is a schematic illustration showing the map position of ELL on chromosome 
3. 

Fig. 1 1 is a schematic illustration showing a comparison of the predicated ELL 
amino acid sequence (designated Ath; SEQ ID NO: 1) with C-14 sterol reductase of 
Saccharomyces cerevisiae (Erg24) and Schizosaccharomyces pombe (Pombe), and C- 
24 sterol reductase of Sz. pombe (Stsl) and S. cerevisiae (Ygl022). 

Fig. 12 is a schematic illustration showing that the predicted ELL amino acid 
sequence (designated Ath; SEQ ID NO: 1) shares homology to human and chicken 
lamin B receptor. 

Figs. 13A-13B are photographs showing that the ell phenotype was not 
corrected by exogenous feeding of brassinolide (1 fuM) in either dark (Fig. 13 A) or 
light (Fig. 13B). From left to right in Figs. 13A-13B: wild-type; ell; wild-type + 
brassinolide; and ell + brassinolide. 
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Fig, 14 is a schematic illustration showing the nucleotide sequence of an 
Arabidopsis C-14 sterol reductase (SEQ ID NO: 2) and its deduced amino acid 
sequence polypeptide (SEQ ID NO: 1 ). 

Fig. 15 is a schematic illustration showing the genomic nucleotide sequence of 
5 an Arabidopsis C-14 sterol reductase polypeptide (SEQ ID NO: 3). 

Fig. 16 is a schematic illustration showing the sequence comparison between 
the genomic nucleotide sequence (SEQ ID NO: 3) and cDNA sequences (SEQ ID NO: 
2) of an Arabidopsis C-14 sterol reductase. 

10 There now follows a description of an Arabidopsis mutant, ell (extra long life), 

that displays a life span that is at least three times greater than wild-type plants. The 
ell mutant was isolated by T-DNA tagging methods and was shown to encode a novel 
C-14 sterol reductase. This example is provided for the purpose of illustrating the 
invention, and should not be construed as limiting. 

15 

Identification and Developmental Effects of the ell Mutation By screening for 
mutants displaying BR deficiency or constitutive cytokinin activity, a recessive 
mutation causing pleiotropic developmental effects was identified according to 
conventional methods in an Arabidopsis T-DNA insertional mutant collection 

20 (Feldmann, Plant J. 1:71, 1991; Errampalli et ah, Plant Cell 3: 149, 1991). This 
mutant, termed "ell", was found to have a number of developmental abnormalities. 

For example, unlike wild-type plants, ell mutants displayed constitutive light- 
morphogenesis (Fig. 2), similar to the Arabidopsis det2 (Chory et al., Plant Cell 3: 
445, 1991) and c/w/ (Szekeres et al., Cell 85: 171, 1996) mutants. In addition, 

25 compared to wild-type plants, ell plants had darker green rosette leaves (Figs. 3A-3B), 
reduced apical dominance (Fig. 4), stunted hairy roots, and irregular hypocotyl and 
cotyledons (Fig. 5). Furthermore, as shown in Figs. 6A-6B, the ell mutant showed 
reduced and ruffled sepals and petals. The ell mutant also showed delayed and altered 
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embryo development (Figs. 7A-7F) and was found to have reduced fertility, producing 
wrinkled seeds that precociously germinated (Figs. 8A-8B). In addition, the various 
phenotypes of ell overlapped with amp- J {ptl) (Chaudhury et. al., Plant J. 4: 907, 
1993) and hauptling (Jurgens et aL, Ann. Rev. Genet. 28: 351, 1994), including 
5 supernumerary cotyledons (Figs. 9A-9F). 

Finally, as shown in Figs. 1A-1B, the morphology of the T-DNA tagged ell 
mutant seedlings was phenocopied by treating wild-type seedlings with 30 \iM 
dimethylallylamine purine (2ip), a synthetic cytokinin. 

Despite having a number of developmental abnormalities, ell mutants were 

10 found to have a life span that was at least three times greater than wild-type plants. 
Genetic Analysis and Molecular Cloning of ELL 

Standard segregation analysis indicated that ell is a recessive mutation. The T2 
population of the transgenic line carrying the ell mutant showed a 3: 1 Mendelian 
segregation of the T-DNA using kanamycin resistance (kan r ) as a selectable marker. 

15 Of the kan r plants, thirty-three percent showed the ell phenotype, indicating that the ell 
mutation was recessive. A T3 population was then generated from selfed T2 kan r 
plants having the wild-type phenotype, and the kan r marker showed a 3: 1 segregation. 
Of the seventy-five percent displaying kan r , twenty-five percent showed the ell 
phenotype. Because ell homozygous plants were found to be either lethal or sterile, 

20 T2 heterozygous ell plants were subsequently backcrossed to wild-type plants for 
additional segregation analysis. The resulting Fl population from this backcross 
showed a 1 : 1 segregation of the kan r marker, no plants were observed having the ell 
phenotype. The Fl kan r individuals of the backcross were then selfed to produce an 
F2 population. Seventy-five percent of this F2 population was found to be kan r , and 

25 thirty-three percent of the kan T resistant plants showed the ell phenotype, confirming 
the recessive nature of this mutant. Consistent segregation of the ell phenotype and 
kan r marker was also observed in a subsequent backcross, further indicating that ell 
was tagged by the T-DNA. 
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Genomic DNA blot analysis, using an NPTII probe derived from the T-DNA 
vector, showed that a unique single copy of T-DNA was integrated into the ell 
genome. This result, together with the segregation data described above, further 
indicated that the ell phenotype was associated with the kan r marker, and that the ell 
5 mutation resulted from a single T-DNA insertion in the Arabidopsis genome. 

The T-DNA-tagged locus was then isolated by constructing a genomic DNA 
library from the ell mutant and was mapped by hybridization using the NPTII probe. 
Fig. 10A shows the physical map of the T-DNA tagged locus that was determined by 
DNA hybridization. One of three genomic clones that were found to hybridize to the 

10 NPTII probe was partially sequenced and found to have a complete T-DNA insertion 
and flanking plant sequences. A segment of this genomic clone containing both T- 
DNA and plant sequences was then used to screen a genomic library that was prepared 
from wild-type plants. Two positive clones that were identified in this screen were 
then sequenced. The genomic nucleotide sequence is presented in Fig. 15 (SEQ ID 

15 NO: 3). 

The T-DN A-plant DNA insert junctions were also used as probes to screen a 
cDNA library that was prepared from wild-type plants. One isolated cDNA clone, 
designated D13, was found to have a nucleotide sequence (SEQ ID NO: 1) that 
matched the genomic sequences flanking the right T-DNA border. Comparison of the 

20 cDNA (Fig. 14) with the genomic DNA sequence (Fig. 1 5) also revealed that the T- 
DNA was inserted at a location forty base pairs upstream of the 5' end of the ELL 
cDNA transcript (Fig. 16). The complete genomic fragment covering the cDNA 
sequence was composed of 14 exons and 13 introns (Fig. 10A). Probes that were 
prepared from both the cDNA or genomic clone were then used for DNA blot 

25 analysis. Results from this analysis confirmed that the ELL gene was of plant origin. 

We also determined the chromosomal position of ELL by standard segregation 
analysis of restriction fragment length polymorphisms (RFLPs) in recombinant inbred 
lines (Nam et al M Plant Cell 1 : 699, 1989; Lister and Dean, Plant J. 4: 745, 1993; 
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Hauge et ah, Plant J. 3: 745, 1993; Schmidt et al., Science 270: 480, 1995; Zachgo et 
aL, Genomic Res. 6: 19, 1996). By this analysis, we found that ELL is located on 
chromosome 3 and is flanked by the chromosomal markers by mi456 and g2778 (Fig. 
10B). 

ELL Encodes a Novel C-14 Sterol Reductase 

A comparison of the deduced polypeptide sequence of the full-length ELL 
cDNA clone to the GenBank database showed that ELL had 35% identity to C-14 
sterol reductase (Erg24) in yeast (Lorenz and Parks, DNA Cell Biol. 9: 685, 1992; Lai 
et al., Gene 140: 41, 1994) (Fig. 1 1) and 40% identity to the lamin B receptor (LBR) 
in humans (Ye and Worman, J. Biol Chem. 269: 1 1306, 1994) (Fig. 12). In addition, 
the amino acid sequence of ELL predicted several hydrophobic regions and between 
eight to nine transmembrane domains, consistent with the yeast Erg24 and human 
LBR. However, ELL was observed to lack a basic nucleoplasmic amino-terminal 
domain of about 200 amino acids that has been identified in human LBR. Database 
searches also revealed that at least two Arabidopsis expression sequence tagged (EST) 
clones (GenBank accession numbers T4501 1 and T42407) shared homology to ELL. 
DNA sequencing revealed that T4501 1 encodes an unknown gene with 60% 
nucleotide sequence identity to ELL. The predicted amino acid sequence of T4501 1 
was also observed to have greater than 50% identity to the yeast ERG24 and human 
LBR. These results further confirmed that ELL is encoded by a gene that is a member 
of the C-14 sterol reductase gene family. T42407 was found to encode an Arabidopsis 
sterol A7-reductase (Lecain et aL, J. Biol. Chem. 271: 10866, 1996) that shares 32% 
amino acid identity to ELL. 

RNA blot analysis indicated multiple transcripts hybridizing to the full-length 
ELL cDNA. 

To determine whether the ell mutant phenotype is corrected by exogenous 
feeding of brassinolide, we germinated ell seedlings on agar plates containing 1 \xM 
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brassinolide or 1 ^M 24-epibrassinolide (Li et a!., Science 272: 398, 1996). The 
results of these experiments showed that the presence of brassinolide or 24- 
epibrassinolide, in the growth medium of ell plants did not alter the mutant phenotype 
(Fig. 13A-13B). Thus, it appears that steroid compounds other than BRs are needed to 
5 restore an ell mutant to a normal growth and development phenotype, as reflected by 
the pleiotropic phenotypes such as stunted roots (Fig. 2) and impaired embryogenesis 
(Fig. 7A-7F). 

To confirm that ELL activity was upstream of DET2 in the sterol biosynthesis 
pathway, a double mutant between ell and detl was constructed and analyzed. The 
1 0 phenotype of det2/ell was indistinguishable from ell, further supporting the hypothesis 
that DET2 was epistatic to ELL. 

Isolation of Other C-14 Sterol Reductase cDNAs and Genomic DNAs 

Based on the C-14 sterol reductase genes and polypeptides described herein, 

15 the isolation of additional plant C-14 sterol reductase coding sequences is made 

possible using standard strategies and techniques that are well known in the art. For 
example, using all or a portion of the amino acid sequence of a C-14 sterol reductase 
polypeptide, one may readily design C-14 sterol reductase-specific oligonucleotide 
probes, including C-14 sterol reductase degenerate oligonucleotide probes (i.e., a 

20 mixture of all possible coding sequences for a given amino acid sequence). These 
oligonucleotides may be based upon the sequence of either DNA strand and any 
appropriate portion of the C-14 sterol reductase sequence (for example, Fig. 14; SEQ 
ID NOS: 2 and 1, respectively; and Fig. 15 (SEQ ID NO: 3). General methods for 
designing and preparing such probes are provided, for example, in Ausubel et al., 

25 1 996, Current Protocols in Molecular Biology, Wiley Interscience, New York, and 

Berger and Kimmel, Guide to Molecular Cloning Techniques, 1987, Academic Press, 
New York. These oligonucleotides are useful for C-14 sterol reductase gene isolation, 
either through their use as probes capable of hybridizing to C-14 sterol reductase 
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complementary sequences or as primers for various amplification techniques, for 
example, polymerase chain reaction (VCR) cloning strategies. 

Hybridization techniques and screening procedures are well known to those 
skilled in the art and are described, for example, in Ausubel et al. (supra); Berger and 
Kimmel (supra); and Sambrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, New York. If desired, a combination of 
different oligonucleotide probes may be used for the screening of a recombinant DNA 
library. The oligonucleotides may be detectably-labeled using methods known in the 
art and used to probe filter replicas from a recombinant DNA library. Recombinant 
DNA libraries are prepared according to methods well known in the art, for example, 
as described in Ausubel et al. (supra), or they may be obtained from commercial 
sources. 

For detection or isolation of closely related C-14 sterol reductase sequences 
having greater than 80% identity, high stringency conditions are preferably used; such 
conditions include hybridization at about 65 °C and about 50% formamide, a first 
wash at about 65 °C, about 2X SSC, and 1% SDS, followed by a second wash at about 
65 °C and about 0.1% SDS, and 0.1X SSC. Lower stringency conditions for detecting 
C-14 sterol reductase genes having about 40-50% sequence identity to the C-14 sterol 
reductase genes described herein include, for example, hybridization at about 37 °C in 
the absence of formamide, a first wash at about 37°C, about 6X SSC, and about 1% 
SDS, and a second wash at about 37°C, about 6X SSC, and about 1% SDS. These 
stringency conditions are exemplary; other appropriate conditions may be determined 

by those skilled in the art. 

As discussed above, C-14 sterol reductase oligonucleotides may also be used as 
primers in amplification cloning strategies, for example, using PCR. PCR methods 
are well known in the art and are described, for example, in PCR Technology, Erlich, 
ed., Stockton Press, London, 1989; PCR Protocols: A Guide to Methods and 
Applications, Innis et al., eds., Academic Press, Inc., New York, 1990; and Ausubel et 
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al. (supra). Primers are optionally designed to allow cloning of the amplified product 
into a suitable vector, for example, by including appropriate restriction sites at the 5' 
and 3 1 ends of the amplified fragment (as described herein). If desired, C-14 sterol 
reductase sequences may be isolated using the PCR "RACE" technique, or Rapid 
5 Amplification of cDNA Ends (see, e.g., Innis et al. (supra)). By this method, 

oligonucleotide primers based on a C-14 sterol reductase sequence are oriented in the 
3 1 and 5' directions and are used to generate overlapping PCR fragments. These 
overlapping 3*- and 5'-end RACE products are combined to produce an intact full- 
length cDNA. This method is described in Innis et al. (supra); and Frohman et al., 
10 Proc. Natl. Acad. ScL USA 85: 8998, 1988. 

Alternatively, any plant cDNA expression library may be screened by 
functional complementation of a yeast C-14 reductase mutant (for example, the erg24 
mutant described by Lorenz and Parks, DNA Cell Biol. 9: 685, 1992) according to 
standard methods. 

1 5 Useful C-14 sterol reductase sequences may be isolated from any appropriate 

organism. Confirmation of a sequenced relatedness to the C-14 sterol reductase 
polypeptide family may be accomplished by a variety of conventional methods 
including, but not limited to, functional complementation assays and sequence 
comparison. In addition, the activity of any C-14 sterol reductase sequence may be 

20 evaluated according to any of the techniques described herein. 

C-14 Sterol Re ductase Polypeptide Expression 

C-14 sterol reductase polypeptides may be produced by transformation of a 
suitable host cell with all or part of a C-14 sterol reductase cDNA (for example, the 
25 cDNA described above) in a suitable expression vehicle or with a plasmid construct 
engineered for increasing the expression of a 
C-14 sterol reductase polypeptide (supra) in vivo. 
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Those skilled in the field of molecular biology will understand that any of a 
wide variety of expression systems may be used to provide the recombinant protein. 
The precise host cell used is not critical to the invention. The C-14 sterol reductase 
protein may be produced in a prokaryotic host, for example, E. coli, or in a eukaryotic 
5 host, for example, Saccharomyces cerevisiae, mammalian cells (for example, COS 1 
or NIH 3T3 cells), or any of a number of plant cells including, without limitation, 
algae, tree species, ornamental species, temperate fruit species, tropical fruit species, 
vegetable species, legume species, monocots, dicots, or in any plant of commercial or 
agricultural significance. Particular examples of suitable plant hosts include, but are 

10 not limited to, Conifers, Petunia, Tomato, Potato, Tobacco, Arabidopsis, Lettuce, 
Sunflower, Oilseed rape, Flax, Cotton, Sugarbeet, Celery, Soybean, Alfalfa, 
Medicago, Lotus, Vigna, Cucumber, Carrot, Eggplant, Cauliflower, Horseradish, 
Morning Glory, Poplar, Walnut, Apple, Asparagus, Rice, Maize, Millet, Onion, 
Barley, Orchard grass, Oat, Rye, and Wheat. 

15 Such cells are available from a wide range of sources including the American 

Type Culture Collection (Rockland, MD); or from any of a number seed companies, 
for example, W. Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. 
(Greenwood, SC), Johnny Seed Co. (Albion, ME), or Northrup King Seeds 
(Harstville, SC). Descriptions and sources of useful host cells are also found in Vasil 

20 I.K., Cell Culture and Somatic Cell Genetics of Plants , Vol I, II, III Laboratory 

Procedures and Their Applications Academic Press, New York, 1984; Dixon, R.A., 
Plant Cell Culture-A Practical Approach, IRL Press, Oxford University, 1985; Green 
et al., Plant Tissue and Cell Culture, Academic Press, New York, 1987; and Gasser 
and Fraley, Science 244: 1293, 1989. 

25 For prokaryotic expression, DNA encoding a C-14 sterol reductase polypeptide 

is carried on a vector operably linked to control signals capable of effecting expression 
in the prokaryotic host. If desired, the coding sequence may contain, at its 5' end, a 
sequence encoding any of the known signal sequences capable of effecting secretion 
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of the expressed protein into the periplasmic space of the host cell, thereby facilitating 
recovery of the protein and subsequent purification. Prokaryotes most frequently used 
are various strains of E. coli; however, other microbial strains may also be used. 
Plasmid vectors are used which contain replication origins, selectable markers, and 
5 control sequences derived from a species compatible with the microbial host. 

Examples of such vectors are found in Pouwels et ah (supra) or Ausubel et al. (supra). 
Commonly used prokaryotic control sequences (also referred to as "regulatory 
elements") are defined herein to include promoters for transcription initiation, 
optionally with an operator, along with ribosome binding site sequences. Promoters 

10 commonly used to direct protein expression include the beta-lactamase (penicillinase), 
the lactose (lac) (Chang et al., Nature 198: 1056, 1977), the tryptophan (Trp) (Goeddel 
et al., Nucl. Acids Res. 8: 4057, 1980), and the tac promoter systems, as well as the 
lambda-derived P L promoter and N-gene ribosome binding site (Simatake et al., 
Nature 292: 128, 1981). 

1 5 One particular bacterial expression system for C- 14 sterol reductase 

polypeptide production is the E. coli pET expression system (Novagen, Inc., Madison, 
WI). According to this expression system, DNA encoding a C-14 sterol reductase 
polypeptide is inserted into a pET vector in an orientation designed to allow 
expression. Since the C-14 sterol reductase gene is under the control of the T7 

20 regulatory signals, expression of C-14 sterol reductase is induced by inducing the 

expression of T7 RNA polymerase in the host cell. This is typically achieved using 
host strains which express T7 RNA polymerase in response to IPTG induction. Once 
produced, recombinant C-14 sterol reductase polypeptide is then isolated according to 
standard methods known in the art, for example, those described herein. 

25 Another bacterial expression system for C-14 sterol reductase polypeptide 

production is the pGEX expression system (Pharmacia). This system employs a GST 
gene fusion system which is designed for high-level expression of genes or gene 
fragments as fusion proteins with rapid purification and recovery of functional gene 



BNSDOCI D: <WO 9748793A 1 J_> 



WO 97/48793 ^ PCT/US97/10644 



-21- 



products. The protein of interest is fused to the carboxyl terminus of the glutathione 
S-transferase protein from Schistosoma japonicwn and is readily purified from 
bacterial lysates by affinity chromatography using Glutathione Sepharose 4B. Fusion 
proteins can be recovered under mild conditions by elution with glutathione. 
5 Cleavage of the glutathione S-transferase domain from the fusion protein is facilitated 
by the presence of recognition sites for site-specific proteases upstream of this 
domain. For example, proteins expressed in pGEX-2T plasmids may be cleaved with 
thrombin; those expressed in pGEX-3X may be cleaved with factor Xa. 

For eukaryotic expression, the method of transformation or transfection and the 

10 choice of vehicle for expression of the C-14 sterol reductase polypeptide will depend 
on the host system selected. Transformation and transfection methods are described, 
e.g., in Ausubel et al. (supra); Weissbach and Weissbach, Methods for Plant 
Molecular Biology, Academic Press, 1989; Gelvin et al., Plant Molecular Biology 
Manual, Kluwer Academic Publishers, 1990; Kindle, K., Proc. Natl. Acad. ScL, U.S.A 

15 87: 1228, 1990; Potrykus, L, Annu. Rev. Plant Physiol. Plant Mol. Biology 42: 205, 
1991; and BioRad (Hercules, CA) Technical Bulletin #1687 (Biolistic Particle 
Delivery Systems). Expression vehicles may be chosen from those provided, e.g., in 
Cloning Vectors: A Laboratory Manual (P.H. Pouwels et al., 1985, Supp. 1987); 
Gasser and Fraley (supra); Clontech Molecular Biology Catalog (Catalog 1 992/93 

20 Tools for the Molecular Biologist, Palo Alto, CA); and the references cited above. 

Most preferably, an C-14 sterol reductase polypeptide is produced by a stably- 
transfected plant cell line, a transiently-transfected plant cell line, or by a transgenic 
plant. A number of vectors suitable for stable transfection of plant cells or for the 
establishment of transgenic plants are available to the public; such vectors are 

25 described in Pouwels et al. (supra), Weissbach and Weissbach (supra), and Gelvin et 
al. (supra). Methods for constructing such cell lines are described in, e.g., Weissbach 
and Weissbach (supra), and Gelvin et al. (supra). Typically, plant expression vectors 
include (1) a cloned plant gene under the transcriptional control of 5 f and 3* regulatory 
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sequences and (2) a dominant selectable marker. Such plant expression vectors may 
also contain, if desired, a promoter regulatory region (for example, one conferring 
inducible or constitutive, environmentally- or developmentally-regulated, or cell- or 
tissue-specific expression), a transcription initiation start site, a ribosome binding site, 
5 an RNA processing signal, a transcription termination site, and/or a polyadenylation 
signal. 

Alternatively, the C-14 sterol reductase polypeptide may be produced using a 
transient expression system (e.g., the maize transient expression system described by 
Sheen, Plant Cell 2: 1027, 1990). 
10 Once the desired C-14 sterol reductase nucleic acid sequences is obtained, it 

may be manipulated in a variety of ways known in the art. For example, where the 
sequence involves non-coding flanking regions, the flanking regions may be subjected 
to mutagenesis. 

The C-14 sterol reductase DNA sequence of the invention may, if desired, be 
15 combined with other DNA sequences in a variety of ways. The C-14 sterol reductase 
DNA sequence of the invention may be employed with all or part of the gene 
sequences normally associated with the C-14 sterol reductase protein. In its 
component parts, a DNA sequence encoding a C-14 sterol reductase protein is 
combined in a DNA construct having a transcription initiation control region capable 
20 of promoting transcription and translation in a host cell. 

In general, the constructs will involve regulatory regions functional in plants 
which provide for modified production of C-14 sterol reductase protein as discussed 
herein. The open reading frame coding for the C-14 sterol reductase protein or 
functional fragment thereof will be joined at its 5' end to a transcription initiation 
25 regulatory region such as the sequence naturally found in the 5 f upstream region of the 
C-14 sterol reductase structural gene. Numerous other transcription initiation regions 
are available which provide for constitutive or inducible regulation. 
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For applications where developmental, cell, tissue, hormonal, or environmental 
expression is desired, appropriate 5 1 upstream non-coding regions are obtained from 
other genes, for example, from genes regulated during meristem development, seed 
development, embryo development, or leaf development. 
5 Regulatory transcript termination regions may also be provided in DNA 

constructs of this invention as well. Transcript termination regions may be provided 
by the DNA sequence encoding the C-14 sterol reductase protein or any convenient 
transcription termination region derived from a different gene source. The transcript 
termination region will contain preferably at least 1-3 kb of sequence 3* to the 

10 structural gene from which the termination region is derived. Plant expression 
constructs having C-14 sterol reductase as the DNA sequence of interest for 
expression (in either the sense or antisense orientation) may be employed with a wide 
variety of plant life, particularly plant life involved in the production of storage 
reserves (for example, those involving carbon and nitrogen metabolism). Such 

15 genetically-engineered plants are useful for a variety of industrial and agricultural 
applications as discussed below. Importantly, this invention is applicable to 
dicotyledons and monocotyledons, and will be readily applicable to any new or 
improved transformation or regeneration method. 

An example of a useful plant promoter according to the invention is a 

20 caulimovirus promoter, for example, a cauliflower mosaic virus (CaMV) promoter. 

These promoters confer high levels of expression in most plant tissues, and the activity 
of these promoters is not dependent on virally encoded proteins. CaMV is a source for 
both the 35S and 19S promoters. In most tissues of transgenic plants, the CaMV 35S 
promoter is a strong promoter (see, e.g., Odell et al., Nature 313: 810 1985). The 

25 CaMV promoter is also highly active in monocots (see, e.g., Dekeyser et al., Plant 
Cell 2: 591, 1990; Terada and Shimamoto, MoL Gen. Genet 220: 389, 1990). 
Moreover, activity of this promoter can be further increased (i.e., between 2-10 fold) 
by duplication of the CaMV 35S promoter (see e.g., Kay et al., Science 236: 1299, 
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1987; Ow et al., Proc. Natl. Acad. Sci., U.S.A. 84: 4870, 1987; and Fang et ah, Plant 
Cell 1 : 141, 1989). 

Other useful plant promoters include, without limitation, the nopaline synthase 
promoter (An et al., Plant Physiol. 88: 547, 1988) and the octopine synthase promoter 
5 (Fromm et al., Plant Cell 1 : 977, 1 989). 

For certain applications, it may be desirable to produce the C-14 sterol 
reductase gene product in an appropriate tissue, at an appropriate level, or at an 
appropriate developmental time. For this purpose, there are an assortment of gene 
promoters, each with its own distinct characteristics embodied in its regulatory 

10 sequences, shown to be regulated in response to the environment, hormones, and/or 
developmental cues. These include gene promoters that are responsible for heat- 
regulated gene expression (see, e.g., Callis et al., Plant Physiol. 88: 965, 1988; 
Takahashi and Komeda, MoL Gen. Genet. 219: 365, 1989; and Takahashi et al. Plant 
J. 2: 751, 1992), light-regulated gene expression (e.g., the pea rbcSSA described by 

1 5 Kuhlemeier et al., Plant Cell 1: 471, 1 989; the maize rbcS promoter described by 

Schaffher and Sheen, Plant Cell 3: 997, 1991 ; or the cholorphyll a/b-binding protein 
gene found in pea described by Simpson et al., EMBO J. 4: 2723, 1985), hormone- 
regulated gene expression (for example, the abscisic acid (ABA) responsive sequences 
from the Em gene of wheat described by Marcotte et al., Plant Cell 1 : 969, 1989; the 

20 ABA- inducible HVA1 and HVA22, and rd29A promoters described for barley and 
Arabidopsis by Straub et al., Plant Cell 6: 617, 1994, Shen et al., Plant Cell 7: 295, 
1995; and wound- induced gene expression (for example, of wunl described by 
Siebertz et al., Plant Cell 1: 961, 1989), or organ-specific gene expression (for 
example, of the tuber-specific storage protein gene described by Roshal et al., EMBO 

25 J. 6: 1 155, 1987; the 23-kDa zein gene from maize described by Schernthaner et al., 
EMBO J. 7: 1249, 1988; or the French bean fi-phaseolin gene described by Bustos et 
al., Plant Cell 1: 839, 1989). 
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Plant expression vectors may also optionally include RNA processing signals, 
e.g, introns, which have been shown to be important for efficient RNA synthesis and 
accumulation (Callis et aL, Genes and Dev. 1 : 1 183, 1987). The location of the RNA 
splice sequences can dramatically influence the level of transgene expression in plants. 
5 In view of this fact, an intron may be positioned upstream or downstream of a C- 14 

sterol reductase polypeptide-encoding sequence in the transgene to modulate levels of 
gene expression. 

In addition to the aforementioned 5 1 regulatory control sequences, the 
expression vectors may also include regulatory control regions which are generally 

10 present in the 3' regions of plant genes (Thornburg et aL, Proc. Natl. Acad. Set U.S.A. 
84: 744, 1987; An et aL, Plant Cell 1:115, 1989). For example, the 3' terminator 
region may be included in the expression vector to increase stability of the mRNA. 
One such terminator region may be derived from the PI-II terminator region of potato. 
In addition, other commonly used terminators are derived from the octopine or 

15 nopaline synthase signals. 

The plant expression vector also typically contains a dominant selectable 
marker gene used to identify those cells that have become transformed. Useful 
selectable genes for plant systems include genes encoding antibiotic resistance genes, 
for example, those encoding resistance to hygromycin, kanamycin, bleomycin, G418, 

20 streptomycin, or spectinomycin. Genes required for photosynthesis may also be used 
as selectable markers in photosynthetic-deficient strains. Alternatively, the green- 
fluorescent protein from the jellyfish Aequorea victoria may be used as a selectable 
marker (Sheen et al., Plant J. 8:777, 1995; Chiu et aL, Current Biology 6: 325, 1996). 
Finally, genes encoding herbicide resistance may be used as selectable markers; useful 

25 herbicide resistance genes include the bar gene encoding the enzyme phosphinothricin 
acetyltransferase and conferring resistance to the broad spectrum herbicide Basta® 
(Hoechst AG, Frankfurt, Germany). 
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Efficient use of selectable markers is facilitated by a determination of the 
susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed 
cells. Some useful concentrations of antibiotics for tobacco transformation include, 
5 e.g., 75-100 p.g/ml (kanamycin), 20-50 jig/ml (hygromycin), or 5-10 ^ig/ml 

(bleomycin). A useful strategy for selection of transformants for herbicide resistance 
is described, e.g., by Vasil et al., supra. 

It should be readily apparent to one skilled in the art of molecular biology, 
especially in the field of plant molecular biology, that the level of gene expression is 
10 dependent, not only on the combination of promoters, RNA processing signals, and 

terminator elements, but also on how these elements are used to increase the levels of 
selectable marker gene expression. 

Plant Transformation 

15 Upon construction of the plant expression vector, several standard methods are 

available for introduction of the vector into a plant host, thereby generating a 
transgenic plant. These methods include ( 1 ) Agrobacterium-mediated transformation 
{A. tumefaciens ox A. rhizogenes) (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, vol 6, PWJ Rigby, ed, London, Academic Press, 1987; and Lichtenstein, 

20 CP., and Draper, J,. In: DNA Cloning, Vol II, D.M. Glover, ed, Oxford, IRI Press, 
1985)), (2) the particle delivery system (see, e.g., Gordon-Kamm et al., Plant Cell 2: 
603 (1990); or BioRad Technical Bulletin 1687, supra), (3) microinjection protocols 
(see, e.g., Green et al., supra), (4) polyethylene glycol (PEG) procedures (see, e.g., 
Draper et al., Plant Cell Physiol. 23: 451, 1982; or e.g., Zhang and Wu, Theor. AppL 

25 Genet. 76: 835, 1988), (5) liposome-mediated DNA uptake (see, e.g., Freeman et al., 
Plant Cell Physiol 25: 1353, 1984), (6) electroporation protocols (see, e.g., Gelvin et 
al., supra; Dekeyser et al., supra; Fromm et al., Nature 319: 791, 1986; Sheen Plant 
Cell 2: 1027, 1990; or Jang and Sheen Plant Cell 6: 1665, 1994), and (7) the vortexing 
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method (see, e.g., Kindle supra). The method of transformation is not critical to the 
invention. Any method which provides for efficient transformation may be employed. 
As newer methods are available to transform crops or other host cells, they may be 
directly applied. 

The following is an example outlining one particular technique, an 
Agrobacterium-mediated plant transformation. By this technique, the general process 
for manipulating genes to be transferred into the genome of plant cells is carried out in 
two phases. First, cloning and DNA modification steps are carried out in E. coli, and 
the plasmid containing the gene construct of interest is transferred by conjugation or 
electroporation into Agrobacterium. Second, the resulting Agrobacterium strain is 
used to transform plant cells. Thus, for the generalized plant expression vector, the 
plasmid contains an origin of replication that allows it to replicate in Agrobacterium 
and a high copy number origin of replication functional in E. coli. This permits facile 
production and testing of transgenes in E. coli prior to transfer to Agrobacterium for 
subsequent introduction into plants. Resistance genes can be carried on the vector, 
one for selection in bacteria, for example, streptomycin, and another that will function 
in plants, for example, a gene encoding kanamycin resistance or herbicide resistance. 
Also present on the vector are restriction endonuclease sites for the addition of one or 
more transgenes and directional T-DNA border sequences which, when recognized by 
the transfer functions of Agrobacterium, delimit the DNA region that will be 

transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell 
tungsten microprojectiles on which cloned DNA is precipitated. In the Biolistic 
Apparatus (Bio-Rad) used for the shooting, a gunpowder charge (22 caliber Power 
Piston Tool Charge) or an air-driven blast drives a plastic macroprojectile through a 
gun barrel. An aliquot of a suspension of tungsten particles on which DNA has been 
precipitated is placed on the front of the plastic macroprojectile. The latter is fired at 
an acrylic stopping plate that has a hole through it that is too small for the 
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macroprojectile to pass through. As a result, the plastic macroprojectile smashes 
against the stopping plate, and the tungsten microprojectiles continue toward their 
target through the hole in the plate. For the instant invention the target can be any 
plant cell, tissue, seed, or embryo. The DNA introduced into the cell on the 
5 microprojectiles becomes integrated into either the nucleus or the chloroplast. 

In general, transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene 
expression studies in plants and to produce improved plant varieties of agricultural or 
commercial interest. 

10 

Transgenic Plant Regeneration 

Plant cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant 
tissue culture techniques. It is well known in the art that various cells, tissues, and 

15 organs from almost any plant can be successfully cultured to regenerate an entire 
plant; such techniques are described, e.g., in Vasil supra', Green et al., supra; 
Weissbach and Weissbach, supra; and Gelvin et al., supra. 

In one particular example, a cloned C- 14 sterol reductase polypeptide or an 
antisense construct under the control of the 35S CaMV promoter and the nopaline 

20 synthase terminator and carrying a selectable marker (for example, kanamycin 
resistance) is transformed into Agro bacterium. Transformation of leaf discs (for 
example, of tobacco leaf discs), with vector-containing Agrobacterium is carried out 
as described by Horsch et al. (Science 221 \ 1229, 1985). Putative transformants are 
selected after a few weeks (for example, 3 to 5 weeks) on plant tissue culture media 

25 containing kanamycin (e.g. 100^g/ml). Kanamycin-resistant shoots are then placed 
on plant tissue culture media without hormones for root initiation. Kanamycin- 
resistant plants are then selected for greenhouse growth. If desired, seeds from self- 
fertilized transgenic plants can then be sowed in a soil-less medium and grown in a 
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greenhouse. Kanamycin-resistant progeny are selected by sowing surfaced sterilized 
seeds on hormone-free kanamycin-containing media. Analysis for the integration of 
the transgene is accomplished by standard techniques (see, for example, Ausubel et al. 

supra; Gelvin et al. supra). 
5 Transgenic plants expressing the selectable marker are then screened for 

transmission of the transgene DNA by standard immunoblot and DNA detection 
techniques. Each positive transgenic plant and its transgenic progeny are unique in 
comparison to other transgenic plants established with the same transgene. Integration 
of the transgene DNA into the plant genomic DNA is in most cases random, and the 
10 site of integration can profoundly affect the levels and the tissue and developmental 
patterns of transgene expression. Consequently, a number of transgenic lines are 
usually screened for each transgene to identify and select plants with the most 
appropriate expression profiles. 

Transgenic lines are evaluated for levels of transgene expression. Expression at 
15 the RNA level is determined initially to identify and quantitate expression-positive 
plants. Standard techniques for RNA analysis are employed and include PCR 
amplification assays using oligonucleotide primers designed to amplify only transgene 
RNA templates and solution hybridization assays using transgene-specific probes (see, 
e.g., Ausubel et al., supra). The RNA-positive plants are then analyzed for protein 
20 expression by Western immunoblot analysis using C-14 sterol reductase specific 
antibodies (see, e.g., Ausubel et aL, supra). In addition, in situ hybridization and 
immunocytochemistry according to standard protocols can be done using transgene- 
specific nucleotide probes and antibodies, respectively, to localize sites of expression 

within transgenic tissue. 
25 Once the recombinant C-14 sterol reductase protein is expressed in any cell or 

in a transgenic plant (for example, as described above), it may be isolated, e.g., using 
affinity chromatography. In one example, an anti-C14 sterol reductase antibody (e.g., 
produced as described in Ausubel et al., supra, or by any standard technique) may be 
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attached to a column and used to isolate the polypeptide. Lysis and fractionation of C- 
14 sterol reductase-producing cells prior to affinity chromatography may be performed 
by standard methods (see, e.g., Ausubel et ah, supra). Once isolated, the recombinant 
protein can, if desired, be further purified, for example, by high performance liquid 
5 chromatography (see, e.g., Fisher, Laboratory Techniques In Biochemistry And 
Molecular Biology, eds., Work and Burdon, Elsevier, 1980). 

Polypeptides of the invention, particularly short C-14 sterol reductase protein 
fragments, can also be produced by chemical synthesis (e.g., by the methods described 
in Solid Phase Peptide Synthesis, 2nd ed., 1984 The Pierce Chemical Co., Rockford, 
10 IL). 

These general techniques of polypeptide expression and purification can also be 
used to produce and isolate useful C-14 sterol reductase fragments or analogs. 

Antibodies 

15 C-14 sterol reductases described herein (or immunogenic fragments or analogs) 

may be used to raise antibodies useful in the invention; such polypeptides may be 
produced by recombinant or peptide synthetic techniques (see, e.g., Solid Phase 
Peptide Synthesis, 2nd ed., 1984, Pierce Chemical Co., Rockford, IL; Ausubel et al., 
supra). The peptides may be coupled to a carrier protein, such as KLH as described in 

20 Ausubel et al, supra. The KLH-peptide is mixed with Freund's adjuvant and injected 
into guinea pigs, rats, or preferably rabbits. Antibodies may be purified by peptide 
antigen affinity chromatography. 

Monoclonal antibodies may be prepared using the C-14 sterol reductase 
polypeptides described above and standard hybridoma technology (see, e.g., Kohler et 

25 aL, Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:51 1, 1976; Kohler et al., 
Eur. J. Immunol 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T 
Cell Hybridomas, Elsevier, NY, 1981; Ausubel et al., supra). 
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Once produced, polyclonal or monoclonal antibodies are tested for specific C- 
14 sterol reductase recognition by Western blot or immunoprecipitation analysis (by 
the methods described in Ausubel et al., supra). Antibodies which specifically 
recognize C-14 sterol reductases are considered to be useful in the invention; such 
5 antibodies may be used, e.g., in an immunoassay to monitor the level of C-14 sterol 
reductase produced by a plant. 

Because the present invention provides for the genetic manipulation of a plant 

10 sterol biosynthetic pathway, this invention described is useful for a variety of 

agricultural and commercial purposes including, but not limited to, increasing crop 
yields, improving crop and ornamental quality, and reducing agricultural production 
costs. For example, the methods, DNA constructs, proteins, and transgenic plants 
described herein are useful for improving a number of fruit and vegetable 

1 5 characteristics including, but not limited to, texture, size, nutritional content, 
modification of sterol composition, disease and insect resistance, and ripening 
processes. In addition, genetic manipulation of plant sterol composition (for example, 
seed sterol composition) is useful for improving food quality and oil stability, and 
regulating the formation of compounds having anti-nutritional properties. 

20 In one particular example, antisense C-14 sterol reductase sequences are useful 

for reducing the expression of C-14 sterol reductase expression in a transgenic plant. 
Such reduced expression of C- 1 4 sterol reductase provides a means for increasing the 
life-span of such plants. Increased life-span extends reproductive period, delays 
senescence, and increases branch number for high productivity and yield. In addition, 

25 transgenic plants expressing antisense C-14 sterol reductase are useful for producing 
plants having reduced and more compact proportions. Such plants require less space 
and land requirements for their growth, and are more convenient and efficient to 
harvest. 
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Overproduction of the C-14 sterol reductase in transgenic plants is useful for 
enhancing the production of steroid compounds having a variety of medicinal or 
agricultural applications. For example, overproduction of mammalian steroid 
hormones in plants offers an inexpensive means for producing such hormones. 
5 In addition, C-14 sterol reductase polypeptides disclosed herein are useful for 

the development of enzyme inhibitors of the sterol biosynthetic pathway. 

Other Embodiments 

In other embodiments, the invention includes any protein which is substantially 

10 identical to a crucifer C-14 sterol reductase polypeptide (Fig. 10; SEQ ID NO:l); such 
homologs include other substantially pure naturally-occurring plant C- 1 4 sterol 
reductase proteins as well as allelic variants; natural mutants; induced mutants; 
proteins encoded by DNA that hybridizes to the C-14 sterol reductase DNA sequence 
of Fig. 14 (SEQ ID NO: 2) under high stringency conditions or, less preferably, under 

15 low stringency conditions (e.g., washing at 2X SSC at 37°C with a probe length of at 
least 10-15 nucleotides), both as described herein; and proteins specifically bound by 
antisera directed to a C-14 sterol reductase polypeptide. The term also includes 
chimeric polypeptides that include a C-14 sterol reductase portion. 

The invention further includes analogs of any naturally-occurring plant C-14 

20 sterol reductase polypeptide. Analogs can differ from the naturally-occurring C-14 
sterol reductase protein by amino acid sequence differences, by post-translational 
modifications, or by both. Analogs of the invention will generally exhibit at least 
30%, more preferably 40%, and most preferably 50% or even 80-95% identity with all 
or part of a naturally-occurring plant C-14 sterol reductase amino acid sequence. The 

25 length of sequence comparison is at least 15 amino acid residues, preferably at least 25 
amino acid residues, and more preferably more than 35 amino acid residues. 
Modifications include in vivo and in vitro chemical derivatization of polypeptides, 
e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications 
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may occur during polypeptide synthesis or processing or following treatment with 
isolated modifying enzymes. Analogs can also differ from the naturally-occurring C- 
14 sterol reductase polypeptide by alterations in primary sequence. These include 
genetic variants, both natural and induced (for example, resulting from random 
5 mutagenesis by irradiation or exposure to ethanemethylsulfate or by site-specific 

mutagenesis as described in Sambrook, Fritsch and Maniatis, Molecular Cloning: A 
Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et aL, supra). Also 
included are cyclized peptides, molecules, and analogs which contain residues other 
than L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino 

10 acids, e.g., P or y amino acids. 

In addition to full-length polypeptides, the invention also includes C-14 sterol 
reductase polypeptide fragments. As used herein, the term "fragment," means at least 
20 contiguous amino acids, preferably at least 30 contiguous amino acids, more 
preferably at least 50 contiguous amino acids, and most preferably at least 60 to 80 or 

15 more contiguous amino acids. Fragments of C-14 sterol reductase polypeptides can be 
generated by methods known to those skilled in the art or may result from normal 
protein processing (e.g., removal of amino acids from the nascent polypeptide that are 
not required for biological activity or removal of amino acids by alternative mRN A 
splicing or alternative protein processing events). 

20 Furthermore, the invention includes nucleotide sequences that facilitate specific 

detection of a C-14 sterol reductase nucleic acid. Thus, C-14 sterol reductase 
sequences described herein (e.g., SEQ ID NO: 2 and 3) or portions thereof may be 
used as probes to hybridize to nucleotide sequences from other plants (e.g., dicots, 
monocots, gymnosperms, and algae) by standard hybridization techniques under 

25 conventional conditions. Sequences that hybridize to a C-14 sterol reductase coding 
sequence or its complement and that encode a C-14 sterol reductase are considered 
useful in the invention. As used herein, the term "fragment," as applied to nucleic acid 
sequences, means at least 5 contiguous nucleotides, preferably at least 10 contiguous 
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nucleotides, more preferably at least 20 to 30 contiguous nucleotides, and most 
preferably at least 40 to 80 or more contiguous nucleotides. Fragments of C- 14 sterol 
reductase nucleic acid sequences can be generated by methods known to those skilled 
in the art. 

5 All publications and patent applications mentioned in this specification are 

herein incorporated by reference to the same extent as if each independent publication 
or patent application was specifically and individually indicated to be incorporated by 
reference. 

Other embodiments are within the following claims. 

0 

What is claimed is: 
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Claims 



1 . A substantially pure plant C- 1 4 sterol reductase polypeptide. 



5 2. A substantially pure polypeptide comprising an amino acid sequence 

substantially identical to the sequence shown in Fig. 14 (SEQ ID NO: 1). 



3. The polypeptide of claim 1 or 2, wherein said polypeptide comprises the 
amino acid sequence shown in Fig. 14 (SEQ ID NO: 1). 



4. The polypeptide of claim 1 or 2, wherein said polypeptide is from a dicot 



5. The polypeptide of claim 4, wherein said dicot is a crucifer. 



15 6. A purified DNA encoding a C-14 sterol reductase polypeptide. 



7. A purified DNA comprising a sequence substantially identical to the DNA 
sequence shown in Fig. 14 (SEQ ID NO: 2). 



20 8. The purified DNA of claim 6 or 7, wherein said DNA comprises the 

sequence shown in Fig. 14 (SEQ ID NO: 2). 



9. The purified DNA of claim 6 or 7, wherein said DNA encodes a polypeptide 
which has an amino acid sequence substantially identical to that shown in Fig. 14 
25 (SEQ ID NO: 1). 



10. The purified DNA of claim 9, wherein said DNA encodes a polypeptide 
which has the amino acid sequence shown in Fig. 14 (SEQ ID NO: 1). 
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1 1 . The purified DNA of claim 6 or 7, wherein said DNA is from a dicot. 



12. The purified DNA of claim 1 1, wherein said dicot is a crucifer. 
5 1 3. A vector comprising the purified DNA of claim 6 or 7. 

14. A cell comprising the purified DNA of claim 6 or 7. 

15. A method of producing a recombinant C-14 sterol reductase polypeptide 
10 comprising 

providing a cell transformed with purified DNA encoding a C-14 sterol 
reductase polypeptide positioned for expression in said cell, and 

culturing said transformed cell under conditions for expressing said DNA. 

15 16. The method of claim 15, wherein said method further comprises recovering 

said recombinant C-14 sterol reductase polypeptide. 

17. The method of claim 15, wherein said cell is a plant cell. 

20 1 8. A recombinant C-14 sterol reductase produced by the method of claim 15. 

19. An isolated antibody which specifically recognizes and binds a plant C-14 
sterol reductase polypeptide. 

25 20. A transgenic plant which contains DNA encoding a C-l 4 sterol reductase 

polypeptide integrated into the genome of said plant, wherein said DNA is expressed 
in said transgenic plant. 
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2 1 . A transgenic plant which contains DN A encoding an amino acid sequence 
substantially identical to the sequence shown in Fig. 14 (SEQ ID NO: 1) 
integrated into the genome of said plant, wherein said DNA is expressed in said 
transgenic plant. 



22. A seed from a transgenic plant of claim 20 or 21 . 



23. A cell from a transgenic plant of claim 20 or 2 1 . 



10 24. A method of detecting a C-14 sterol reductase gene in a plant cell 

comprising: 

contacting the purified DNA of claim 6 or 7 or a portion thereof greater than 
about 12 nucleotides in length with a preparation of genomic DNA from said plant cell 
under hybridization conditions providing detection of DNA sequences having about 
15 40% or greater sequence identity to SEQ ID NO: 2. 



25. A method of isolating a C-14 sterol reductase gene or portion thereof, said 
method comprising 

(a) amplifying said C-14 sterol reductase gene using oligonucleotide primers, 
20 wherein said primers each have regions of complementarity to opposite DNA strands 

in a region of SEQ ID NO: 2; and 

(b) isolating said C-14 sterol reductase gene or portion thereof. 



26. A method for reducing the level of a C-14 sterol reductase polypeptide in a 
25 transgenic plant cell, said method comprising expressing in a plant cell an antisense C- 
14 sterol reductase nucleic acid sequence. 
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27. The method of claim 26, wherein said antisense C-14 sterol reductase 
nucleic acid sequence is encoded by a transgene integrated into the genome of said 
transgenic plant cell. 

5 28. The method of claim 26, wherein said C-14 sterol reductase sequence is 

SEQ ID NO: 2 or SEQ ID NO: 3. 

29. The method of claim 26, wherein said method further comprises growing a 
transgenic plant from said transgenic plant cell, whereby the level of the C-14 sterol 

10 reductase polypeptide is reduced in said transgenic plant. 

30. A method for increasing the level of a C-14 sterol reductase in a transgenic 
plant cell, comprising expressing in said transgenic plant cell a nucleic acid sequence 
encoding a polypeptide substantially identical to SEQ ID NO: 1 . 

15 

31. A transgenic plant comprising purified DNA encoding a plant C-14 sterol 
reductase polypeptide, said DNA comprising a knockout mutation in said C- 1 4 sterol 
reductase sequence. 
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Ell.Tmp6 Length: 6588 May 3, 1996 10:36 Type; N Check: 7899 

1 TNTTGAAGGN TNAAGAAAAA NTANGGTAAG CTGGGNAGGA CAAGANTTCT 

51 TGTNACCACA ACACAACAAC GCCATGAACC NATCGGTTTC TTNTGTTTNG 

101 AGATCACCTT TCTTGAGTTG GTGGTTTCTG AGMTCAAGNT CCTTGTTGAC 

151 TCAGTGAAGT CC AG ATGC AG CNTCAAAACT TTTCTCCTGT AGACNTAGCA 

201 AGAGTAACAG CACCAACCAA ATCGCTATCC GATGTAATCA AAAC CTTATC 

251 ACNTTCATCG TCCTCATATA TAATCTGAGG CCGTTGTTCC ACATTGTTAT 

301 TGTCGCTGCC AATTCTTTCC ATCACAATAC CCATCAGCTN TTCGAGGTTT 

3 51 TCAGCTCCAG AAGTAAACCG ATGTACACGG CCCTTAAGGT CTTCAAATTT 

401 GAACGAAAAC GAATTCCCTA GTCCTAGAGA TGGGTAAGAA CTGAGCTTCC 

451 CTATATCTGA ATGATGCATC ATTGCCGACA TTTCACTTTG AGTGTCAGAA 

501 TCATCAGGTG GCTCTAACGC AAGAGCTGAA TCCCAAAATT TCTGCATCAT 

551 CGTGTTTGCC ATATCATTTA CAGCTCCAGA ACTGTTCTCC ACC ATTG AAA 

601 TAGCTGCGTG AGTAATCTGA AGAACGTCTA CACAAGCTGC AGCTGATCCA 

651 TCTTTATCTA TAATTGGAAG ATGTAGAAAC TTTCCATCAT GCATTGTATG 

701 CAATGCATCC AGAATCGTTG TCTCTAGCGA TGCACATTCA GGATTCGGTG 

751 TCATTACCTT CTCGACAAGA GTCAATTCAG GAGATAAATT TTGTGCCACC 

801 ACTCGCATCA GAATGTCCTT TGAAGTCAAG ATTCCACTGA TTTTGTTCCC 

851 CCGTGGAAAT G ATT AC AG AG TTAACCCGCA AATCCCTCAT CCTTTTCGCA 

901 GCAACTGAAA CAGGATCTGA TCGTGCTACA AGTGCAACCT TCGATGTCTG 

951 7AATAATCGT GACAAGGCGG GTT AAAC ATT NTNTCCTTCA AGGTTTCAAT 

1051 CCACTGCAGC AGCTAAAGCA CTACCTTGCT CTGCAGTTTC TCCATCCTAG 
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1151 CCATTCTCCA C AAC AGGC AA GTGTCTAAAC TTCCCTTGAA CCATCTTCTG 

12 01 AAGAGCCTCA AGCGCCAACG AATCAGAAGT AACAAAAATA GGATTCCTAG 

12 51 TCATAACCTT AGAGACCAAA CTTTGATCCG GTCTCAACCC TTCAGCAATC 

13 01 ACTCTTGTAG CTACATCTTT ATCAGTAACA A TCCCGG AAA GAAGCGCACT 
13 51 TGAATCAGTC AACAAACAAG CATCAACACG CCTAGCAGCC ATTNTTCGAC 
1401 AAGCATCGAA AACACTAGTT CCTTCNAGGA ATAGTAAGAG CTTTCGATAA 
1451 CCTAAGCTTC TTCGCTGTCT CTCTCCATTA GAAGGAGCTT GAGATTGAGG 
1501 TTGAGGAGGA GGTGAATTGG GTTTTGAGGT GTTCCCANTA ACACTTCCAT 
1551 TCTCTGATTG TACTGGTTTC TTAGAAGGTG GTGGTCCTCN CCGTACAGTA 
160 1 GAATTGCTTC TCCTCCCTGA TGTTGAAGAA GGACCCGTCG CTTGAGTACT 
1651 CATATTCGGT CAATCTAGGG TTTACTTAGA TCCTAAATCC GTCANAAATG 
17 01 ATTCCTTTAG AT ATC AAACT CGTCTCTGCA AATGAAAAAT TCAACCTTTA 
17 51 ATTCACAAAC TATTGAAATT TCATCTAAAG CACGAATCTG AATAAAACCC 
1301 AATTCACAAT AAAGACGATT TGCTCTGAGA ATACGATGCA ACATACACGA 
1851 AAAGGATTCG AATTTAACGG ACGAGGGAAA TGAAACAACT TGAAACCCTA 
19 01 AGGATTTGAG C AG AAGTT AT GTGGGAAGAT TGGGNATTTA GGGTTTACCT 

19 51 TCTTCTTTCT TCNTCAAGGT CTCTCTCTCG AGCACTTTCG TTNCCCCAAA 

20 01 AACNAACGGC TCTTAACAAT TGAGTTAANC C ANTTATCG A GTTTTCATTG 
2 051 GNTGTTCCTG TTTCCGCGTG TGTGGTGGNT CUCC^CCTCC TTTCTTATAA 
2101 TCNACGACTA AAAATGTTAA ANATAANACT AANATTTCTT TC T ANAAAAA 
2151 TCGTAAAANC CAAATGTTTT TTTTTTTCTG ATAAATGTCT AT AAATC AC C 
2201 CTTTCTTTTT AAATAATGAA ATTTGATGAC ATTTATCTCT TGTATCTAGN 
2251 AGAGTTAATG GCTAACATAA ANACCAAAAA AAATTAATTC NAATAAATAT 
23 01 GATTTGTGTG GGTTACATGG AAAAATTGTC AAATAATAAA NCAAAAAAAA 
22 51 ATTGT AT AG A TGCAGTGCAA GTTGTTTCTG GTCAACTTGC CGTCGAGCCT 
2401 CACAACTGTT TGTTACAAGT GGACTCGCAT GTAATTCCCT CTTTTAATAA 

2 501 3AAATGTGAT CGGTGCAGAT TTTTCTATGT ACGTTTAAGC CTTTAAGGTA 

2 ~z t> 1 -»ACGTTTAAT CCNAA-«-*\TAT. CCCTGAATAA wAACACCGAT TAATGGAACC 

2651 
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27 01 GAACCATTAC GGCACCCTTA TATAATGGCA GCATCTTAAT ACT AAAC AAA 

27 51 AGCTTTAGCC TTAGGTTTTA GCTTCCTTCA CTCTTTGC AT AC ATTGTG AA 

23 01 TCTGCGGTTT TAGATGGACC ATAGTGGAAA AAGGCTTTCA TCAATAACTC 

2351 GTGGACTTCA TCAATGGTAG AAAAGANAAT ACATAGTATG GAAAACTAGA 

2901 TATTTGATAT ATTTGGTTCA AACTCTTATC CGGTGTTGAG GTG AT AT AC A 

29 51 CATGAAGACA TAACAATCGC ATAGCCGAGA AACTAGTATT CATTAACCTT 

3 001 TTTCTCTAAA GAGATTGTCC TATCAATCTA AATTTT AG AT GTTAAAAAAA 

3051 AAATggtaag gctaaacagg ccgctaggtt ggttttacga tgatgtaaaa 

3 101 agtagccatc ttaaaataac agtcgcccgc gagaccggcc aggccatccc 

3151 acgggccata ggctcgctca agttgTGCTT GGCAGAATTT AGTAACTTGG 

3 201 GGTTTTGTTA TCAACAATCA ATAGTTTAAG GCTTTACCTG CAAGAAATGA 

3 251 AG AGTTT AAG GGTTCTTTTT GGTATTCCCG ATTCACACAA GTGAGCTAGC 

33 01 TCATCAGAGT CCACGAGCTT CCCACTAAAA AATTGAAAAT TGTTG CTTCT 

* ELL 



3 3 51 GTCAT CTGAA ATT AAAC AAA GCGAGAAAAG GCGATACAAA CGATTTCGAA 

3 401 TGCTTCATCT TCTCCTTTGA AAATCCTTCT TCTGCTTAAT GCTGCTAGAT 

3451 ATGGATCTCG GTGTTCTTCT TCCATCATTG CAATCTGTGA GCTGTCTCTT 

3 501 TAGCTTTTGA CTGTTGCAAT TGTT ATTGTG AAATTTTTGT TCGCTTTTGG 

3 551 ATCAGCTTTT GTTAAATTCG TTCCGAGATT TTAGGTTTAT GTGCTGGTGT 

3 601 TTTACTTCGT TTACTTGGGN CGNTGGCGGA GAAATTCTCC CCGGGAAAGT 

37 01 ATGGTATATT TGATTTGATT TACTCTCTCT ACAA7TCCTG AGAGTCTGTG 

37 51 AGCTCGAAAG TTCATTTCCA TTAGTTTGGT TAATTCAATT TCAGGTCTAT 

3 301 TGGCACTAAT ATTGTTGGTA GCTATTTNGG GAATCTGTGC AAAACTTGGC 

3 351 ATTGTATCAC CTCTTGTAAG TGTAGTTAC A AGATTTCGAT TGTATTTCTA 

3 901 TGAATCCGAA TGCTATATGC TATATGAATC CGATTGCAAT TGCTTTCTCA 
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4251 CAATTGTAGA TCTCATACTT TCTTGTTACA CAAAATGTTG TTAAAAGTTA 

4301 TATATTTTGA CTCCTGCAAG AGCAAAACTA AGAAATAATC TGGTACTATA 

4351 TAGAGTTTGA AACACTGAAT TGGACAAGAT GATTCTATAG AACTTCGTAG 

4 401 AGTGTTGAGT AATTTCTCCT AGAACGGTTG TAGCTTCCTC TTTTTTCCTT 

4451 TTAACCGCAG TGACTTTAGC TTTTGGAACT TTTCTACTGA AACTAGAAGT 

4501 TCTGGTTTTG TCTTTCACTT ATCTCTTCCA AACAACTGCT TCAATTTTTT 

4 551 CTCATATTGT TTGTTTCATG TGATAGGTGG TTTGGAATAC AGCTGAATCC 

4601 TCAGTTTATG AGCATTGATC TCAAGTAATC CATTTTTCTG TTTTTTCTTC 

4651 T ATTTGTC AG CCAAGGCTAC ATCATTGCTT CAGTTTGTTC CGTACTCAAT 

47 01 CGAGTGGCAG TTTAATAATG TAATCAGCAG TTATGCATGG TTATGATGAA 

4751 TGGGAGTTAT TCCTTGTGTA GGTTTTTCTT TGTCAGAGCC GGGATGATGG 

43 01 GATGGCTGCT TATCAATCTC TCTATTCTGG CAAAAAGTGT GCAGGATGGT 

4 851 TCCTTGAGTC AGTCGATGAT CTTTACCAGA TCTTCTGTGC GGTAAATTTG 

4901 GTTTTTACTT ACAAATCTTG CTTCTTGAAN TCTGATCATC TGTGTTTTGT 

4951 TAGTTTTGAT TAGTTTTATA ATTGCAGTTA TATATATTGG ATACTTTGTT 

5001 CATGAAGAAT ACATGACCTC TACGTAAGTT CATGGCGTGT TAAGGAAACA 

5051 CATTTGTCTT ACCAAAAAAT GACCATTTGC ATTATTACAT CTACTTTGAT 

5101 TTTACTCTTT TCAGGTGGGA CATAATTGCA GAGAGACTAG GCTTCATGCT 

5151 AGTGTTTGGA GATCTCCTGT GGATTCCTTT CACTTTTAGC ATTCAGGCAT 

5201 GTAACTGTGA GCCTGAACAC AAACAAGATA TTAATTTATC TTATTGACAG 

5251 TATCTTCTTG GCATGTTACA GTTATTCTCG GAAACAATAT TGTTCTAGAA 

53 01 TGCTTGATCA CTCTGTGACT GAATTGTCTT CTCTCTGGTA CAGGGCTGGT 

53 51 GGCTTTTG C A CAACAAAGTA GAACTAACAA TTCCTGCGAT TGTAGTCAAT 

5401 TGCCTTGTCT TCTTGATAGG GTAAGTTCTG AGACATGGGG TTATTTTCCA 

5451 TTCTT AC ATA TCTACACTAA GAAACCCACT ATTTCTTCTT TGGCAGGTAC 

5501 ATGGTTTTTC GAGGAGCTAA CAAACAAAAA CATATCTTTA AGAAGAACCC 
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5801 CTTGCTCTGT 

5851 GAGTTCACTT 

59 01 AACGTTCGAA 
5951 ATTTGAAAAG 
6001 ATATTTCGTC 

60 51 TATACCTGCT 
6101 TGTGCAGAGA 
6151 CTGGAGAATA 
6201 AATCCTTTCA 
6251 NTG AAATG AT 
63 01 AGTACACGTT 
63 51 ATCAACATTT 
6401 CNAANAAAAC 
6451 AAAATNCT AN 
6501 ATGGANAAAA 
6551 TGGACNACTT 



C CTTC AGTTT GCCATGTGGA ATAAGGTACT CCTNCTGCTT 
ACAGCTACCA AAATCATGTA GAAACTAATA CCAATATCNA 
GTTGATTTGG CTGACTTAAA GATATTGATC TCTAACCATC 
TCTAAAGCTT TCAAGTTCAT TTCCCAAAGC TGTTTTTATG 
TNGTGTATTC TCAGTTCTCC GGTTCCATAT TTCTACCCGA 
GATACTATTG ATATGGAGAG AACGAAGAGA CGAAGTTCGA 
AGTACNAGGA G ATATGGGC A GAGTATCTTA GACTTGTCCC 
CTTCCTTATG TTTATTAGAT GTGCCAAGAG CCAATTCATG 
GATTCATCCT CTTGTGTCTT ATTTTTTCAT TAAATGTGAC 
CCCATTATNG CCTNTTATCA ATGCTTGATT GAAACTTTGT 



TGAGAATTAC TT? AGTCCTT ( GTTATTATTT TAGCATGGAT 
TCGGATTTAT TTNTNGGGTT ATTTTAAAAC CNNAGATTAC 
CATTGTTTGA NGTANGATAA TATGGACTTT TTACTGAAAA 
TAGGGGAACA AATNGAAGTT GAATATGGCT GAATNTTTTT 
TGGAAACTTT TCCCACTTTG AAATG AC AAT NCAAGTTTGG 
AATCACTGGA AACGTTAATG GCCAACCU 
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1 CTGAAATTAAACAAAGCGAGAAAAGGCGATACAAACGATTTCGAATGCTT 5 0 
I I I M I I I I I I I I I II I I I I M I I II I I I I I M I I I I M II I I II I I M I 
3 3 56 CTGAAATTAAACAAAGCGAGAAAAGGCGATACAAACGATTTCGAATGCTT 3 4 05 

m ■ • • • 

5 1 CATCTTCTCCTTTGAAAATCCTTCTTCTGCTTAATGCTGCTAGATATGG A 10 0 
I I I I I I I I M M I I I I I I I I I I M I I M t I I I I I M I I I I i I II I I I M I 
3 40 6 CATCTTCTCCTTTGAAAATCCTTCTTCTGCTTAATGCTGCTAGATATGGA 3 455 

• • • * « 

101 TCTCGGTGTTCTTCTTCC ATC ATTGCAATCT 131 

I I I I II I t I I I II I I I II I I I I I I II I I 1 I I 
3 4 56 TCTCGGTGTTCTTCTTCC ATCATTGCAATCTGTGAGCTGTCTCTTTAGCT 2 50 5 



132 GTTTATGTGCTGGTGTTTTAC 152 

I I I I I II II I II II 1 M I II I 
3 5 5 6 CTTTTGTTAAATTCGTTCCGAG ATTTTAGGTTTATGTGCTGGTGTTTTAC 3 60 5 

■ » * • « 

15 3 TTCGTTTACTT . G3CCGTTGCCGG AG AAATTCTCCCCGGGAAAGTTATTC 2 01 

I II I I I I I I II $1:11:11 I ! I I I I M I II I I I I M I I I I I I II I I I I 

3 60 6 TTC3TTTACTTGGGNCGNTGGCGG AG AAATTCTCCCCGGGAAAGTTATTC 3 6 55 

• • • • a 

2 02 GCGGCGTC CTTTT ATC AGATGGCTCTC AACTTCGTTACCGATGC AAT . . . 243 

II I I I I I II M I ! I I I I I I M I I I I I I I I II I I I I I I I I I M I I II I 



249 GGTCTATTGGCA 2 60 
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FIGURE 16 / SHEET 2 

I It I I I I I M I I 

37 56 G AAAGTTC ATTTCC ATTAGTTTGGTT AATTC AATTTC AGGTCTATTGGC A 3 805 

• * • • « 

261 CTAATATTGTTGGTAGCTATTTTGGG AATCTGTGC AAAACTTGGC ATTGT 3 10 
It I I I I I I 1 I I I I M 1 1 M I M : II I I I I I I I I I I 1 M II If II M 1 I I I 
3 80 6 CTAATATTGTTGGTAGCTATTTNGGG AATCTGTGC AAAACTTGGC ATTGT 3 355 

311 ATCACCTCTT 320 

I I I II M I I I 

3 8 5 6 ATCACCTCTTGTAAGTGTAGTTAC AAGATTTCGATTGTATTTCTATGAAT 3 9 0 5 



• • • * • 

321 GTGGTTGCGGATAGAgGACTTGAgTTAC 3 48 

I I f I I I II M II M M I I I I f I I! I M f 

3 9 56 ATTCCACTGAGATGTTTGGTAGGTGGTTGCGGATAGAGGACTTGAGTTAC 4 0 05 

• • » * . 

3 49 TCTCAGCTACTTTTATTTTCTGTGTTT 375 

I I I I I I I I I I II : : I M I Mil!!! 

4 0 06 TCTC AGCT ACTTNNATTTCTTGTGTTTGGGGAAG ATGATC AATCCTTAGT 4 0 55 



376 TGGTGACATTAgC 3 88 

II M I II II I M 

4106 GTGATTTGTAATCTCCATGATATCTCTTAATATTCTCAGGTGACATTAGC 4155 

• * » ■ * 

389 ATTGTATGTTACTGGGCG AAGTt CCTCg AAT AAggG t Tc t TCCCTAAAgC 43 8 
I I I N I I I I I I I I I I I II I I I I I I I II I I I I N I 1 I I M I I I I M I I I I I 
415 6 ATTGTATGTTACTGGGCG AAGTTCCTCGAATAAGGGTTCTTCCCTAAAGC 4 2 05 

m * • * * 

439 CTCATGTCTCAgGAAATCTTGTACATGACT 468 

1 1 1 N 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i 

4206 CTCATGTCTCAGGAAATCTTGTACATGACTGGTACTAACATAATACAATT 4255 



469 GGTGGTTTGGAATACAGCTGAATCCTCAGT 498 

M I I M I I II I I I I 1 I I I I I I I I I M I II I 
45 56 ATTGTTTGTTTCATGTGATAGGTGGTTTGGAATACAGCTGAATCCTCAGT 4 605 

• * . • 

499 TTATGAGCATTGATCTCAA 517 

I I I I I I I I M I I M M I I I 
4 60 6 TTATGAGCATTGATCTCAAGTAATCCATTTTTCTGTTTTTTCTTCTATTT 4 655 



518 GTTTTTCTTTGTCAGAGCCGGGATGATGGGATGG 551 

I I I I I II I I I I I I I II I I M II I M I I I I I I I I I 
4 756 GTTATTCCTTGTGTAGGTTTTTCTTTGTCAGAGCCGGGATGATGGGATGG 4 80 5 

■ • • * • 

5 52 CTGCTTATCAATCTCTCTATTCTGGCAAAAAGTGTGCAGGATGGTTCCTT 601 
I I I M I I M I M I I I I I I I I I I I I I I I I M I I I | I I II I II I I | I I I I 11 
4 80 6 CTGCTTATC^TCTCTCTATTCTGGCAAAAAGTGTGCAGGATGGTTCCTT 4S55 

* • • • a 

602 GAGTCAGTCGATGATTCTTTACC AGATCTTCTGTGC 637 

I M M I II I I M I I M I t I II I I! I I I I I M I I I I 
4 356 GAGTCAGTCGATGA . TCTTTACCAGATCTTCTGTGCGGTAAATTTGGTTT 4904 
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633 GTTATATATATTGCACTACTTTGTTCAT 66 5 

I I I I I I I I I I I I I 1 I II I II I I II I I I 
49 55 TTTGATTAGTTTTATAATTGCAGTTATATATATTGGA . TACTTTGTTC AT 5 00 3 

• * • • . 
666 GAAgAATACATGACCTCTAC 58 5 

I I I I I I I M I I I I 1 II I I I I 
5004 GAAGAATACATGACCTCTACGTAAGTTCATGGCGTGTTAAGGAAACACAT 50 53 



686 GTGGG AC ATAATTGC AGAGAGACTAgGCTTCATGCTAGT 724 

H II M M If II I M M 11 t M I M M M M U ) M I I I 
5104 ACTCTTTTCAGGTGGGAC ATAATTGCAGAGAGACTAGGCTTCATGCTAGT 5153 

... 

725 GTTTGGAGATCTCCTGTGGATTCCTTTCACTTTTAGCATT 764 

I M 1 I II I I I I M I I I I I f M II I I I I II I M I I I I M I I 
5154 GTTTGGAGATCTCCTGTGGATTCCTTTCACTTTTAGCATTCAGGCATGTA 52 03 



765 CAGGGCTGGTCCC 777 

M I II I I I I I I II 

53 04 TTGATC ACTCTGTG ACTG AATTGTCTTCTCTCTGGTACAGGGCTGGTGGC 53 53 

* * * * i 

77 8 TTTTGC AC AAC AAAGT Ag AACT AAC AGTTC CTGCG ATTGT AGTC AATTGC 827 
I I I I 1 1 I 1 II I I I 1 I i I I I I I I I I I I I II M I I I I I II I I I II i M II I 

53 54 TTTTGC ACAACAAAGTAGAACTAACAATTCCTGCGATTGTAGTC AATTGC 540 3 

« • • • 9 

828 CTTGTCTTCTTGATAG 843 

I I I I I I I I I I I I 1 I 1 I 
5404 CTTGTCTTCTTGATAGGGTAAGTTCTGAGACATGGGGTTATTTTCCATTC 54 5 3 

844 GGTACATG 351 

II I 1 I II I 

54 54 TTACATATCTACACTAAGAAACCCACTATTTCTTCTTTGGCAGGTACATG 5 50 3 

352 GTTTTTCGAgGAgCTAACAAACAAAAACATATCTTTAAGAAGAACCCAAA 901 
I I II II I I I M M I I I I I I II I I II II M I I I I M I I I I M I I I I I I I I I 
550 4 GTTTTTCG AGG AGCTAACAAACAAAAACATATCTTTAAGAAG AAC CC AAA 55 5 3 

* • - » . 

902 AACACCAATATGGGGCAAGCCTCC AGTGGTAGTTGGTGGAAAGTTACTGG 9 51 
I I I I I I I M I 1 I I I I I I M I I I II I I I I I I I I I I M I I I I I I I I I I M I I 
5554 AACACCAATATGGGGCAAGCCTCC AGTGGTAGTTGGTGGAAAGTTACTGG 550 3 

* * * ♦ « 

952 TTTC AGGCT ATT 963 

MINIMUM 

5 60 4 TTTC AGGCT ATTGGTATGTTATATTTATCTTCTCTTGTTTCTTTGCTTGG 56 5 3 



9 64 GGGG AATTGC AAGGCACTGTAATTACCTTGGCGACTTGATGCTT 1007 

M I I M I I I I II I I I II M I I I I M II I I II II I M I M M M I 
57 54 ATG AC AGGGG AATTGC AAGGCACTGTAATTACCTT3GCGACTTGATGCTT 53 03 

. • * . - . 
1008 GCTCTGTCCTTCAGTTTGCCATGTGG AATA 1037 
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IIIIIIIIIMIilllllillllliMIII x 
5804 GCTCTGTCCTTC AGTTTGCC ATGTGG AATAAGGTACTCCTNCTGCTTGAG 58 53 



1038 AGTTCTCCGGTTCCATATTTCTACCCGATAT 1068 

1 1 1 i 1 i 1 1 1 1 1 1 1 1 I M M 1 1 I it 1 1 it M I 

6 0 04 TTTCGTCTNGTGTATTCTCAGTTCTCCGGTTCCATATTTCTACCCGATAT 6053 

A • • • * 

1069 ACCTTCTGATACTATTGATATGGAGAGAACGAAGAGACGAGGTTCGATGT 1118 

MM II I I I M II I i II M M M M M M I I M II I I II M I M I M I 
6054 ACCTGCTGATACTATTGATATGGAGAGAACGAAGAGACGAAGTTCGATGT 6103 

» • k » • 

1119 GCAGAGAAGTACAAGGAGATATGGGCAGAGTATCTTAGACTTGTCCCCTG 1168 

t | I t t I I I I 1 I t : I I I I II II M I I M II I I I I M I t I I II I I II II It t 
6104 GCAGAGAAGTACNAGGAGATATGGGCAGAGTATCTTAGACTTGTCCCCTG 6153 

• • * • • 

1169 G AGAATACTTCCTTATGTTTATTAGATGTGCC AAGAGCCAAGTCATG AAt 1218 

M I I II I I I M I I II 1 t t t M M I M I I I I I M M M \ I M I M 1 M I I 
6154 G AG AAT ACTTC CTT ATG TTT ATT AG ATGTGCC AAG AGCC AATTC ATG AAT 6203 

• • * ■ • 

1219 C CTTTC AG ATTC AC CTCTTGTTGTCTTATTTTTTCC AT AA 1258 

I M I M M M I II I i II I I I M I M 1 I M Ml 
620 4 CCTTTCAGATTCATCCTCTTGTCTCTTATTTTTTCATTAAATGTGACNTG 6253 

• * * • • 

1259 TCTTGTTTTATTTTAGCAATGCTCGAATTGAAACTTTGTAG 1299 

M I : I M II 1 I I I II I I I I M II II I I I II I I 
6254 AAATGATCCCATTATNGCCTNTTATCAATGCTTG . ATTG AAACTTTGT AG 63 02 

1300 TACACTTTTGAAAAATAACTTCAGTCCTT 13 28 

Mill MM I Ml M I M I II M M 
63 03 TAG ACGTTTG . AGAATTACTTCAGTCCTT 63 30 
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PT ANT STEROL REDUCTASES AND USES THEREOF 

5 

Backgro und of the Invention 
This application relates to plant sterol biosynthetic enzymes, genes, and their 

uses. 

Plant sterols belong to a large group of secondary compounds known as 

10 terpenes or isoprenoids. Sterol biosynthesis in plants generally involves a series of 
different enzymatic steps in the isoprenoid pathway that result in the formation of a 
variety of sterol end products (Benveniste Ann. Rev. Biochem. 37:275, 1986). 
Although such sterol compounds have been identified in higher plants, their function 
in plant growth and development is poorly understood. 

15 One such plant sterol, brassinolide, that belongs to a class of sterols referred to 

as brassinosterioids (BR), was first discovered in the pollen of Brassica napus (Grove 
et. al., Nature 281: 216, 1979). Brassinosteroids are growth-promoting natural 
products having structural similarities to animal steroid hormones. The wide 
distribution of brassinosteroids in the plant kingdom, their effect on cell proliferation 

20 and elongation, and their interactions with other plant hormones (e.g., cytokinins), 

have indicated that these compounds are plant-growth regulators. Brassinosteroids are 
thought to promote hypocotyl elongation, leaf unrolling, and xylem differentiation. In 
addition, such compounds are also believed to be involved in de-etiolation of 
cotyledons, root elongation, radial growth, and anthocyanin formation. 

25 The function of plant sterol growth regulators, such as BR, in relationship to 

other classes of plant growth regulators such as auxin, gibberellin, abscisic acid, and 
cytokinin, during plant development also needs to be evaluated. For example, the 
growth regulator, cytokinin, is known to affect a variety of developmental processes 
including photomorphogenesis, chloroplast biogenesis and maintenance, apical 



BNSDOCID: <WO 9748793A1_IA> 



WO 97/48793 PCT/US97/10644 



-2- 



dominance, and senescence. In addition, this growth regulator is thought to 
antagonize BR's ability to promote hypocotyl elongation and cotyledon de-etiolation 



Summary of the Invention 
5 In general, the invention features a substantially pure plant C-14 sterol 

reductase polypeptide. Preferably, the C-14 sterol reductase polypeptide includes an 
amino acid sequence substantially identical to the sequence shown in Fig. 14 (SEQ ID 
NO: 1); and is from a dicot (for example, a crucifer or a solanaceous plant), monocot, 
gymnosperm, or an alga. 

10 In related aspects, the invention features purified DNA that includes a sequence 

encoding a C-14 sterol reductase polypeptide (for example, a sequence substantially 
identical to the DNA sequence shown in Fig. 14; SEQ ID NO: 2; or a DNA sequence 
that encodes a C-14 sterol reductase polypeptide which has an amino acid sequence 
substantially identical to that shown in Fig. 14; SEQ ID NO: 1). The invention also 

15 features a vector and a cell, each of which includes purified DNA encoding a C-14 
sterol reductase polypeptide; and a method of producing a recombinant C-14 sterol 
reductase polypeptide involving providing a cell (for example, a plant cell) 
transformed with purified DNA encoding a C-14 sterol reductase polypeptide 
positioned for expression in the cell, culturing the transformed cell under conditions 

20 for expressing the DNA, and isolating the recombinant C-14 sterol reductase 
polypeptide. The invention further features recombinant C-14 sterol reductase 
produced by such expression of a purified DNA, and an isolated antibody that 
specifically recognizes and binds a plant C-14 sterol reductase polypeptide. 

In addition, the invention features nucleotide sequences that hybridize to a C-14 

25 sterol reductase gene (including the coding sequence of such a gene and its 

complement) and that encode a C-14 sterol reductase polypeptide. Furthermore, the 
invention includes oligonucleotide probes that detect a C-14 sterol reductase gene or 
functional equivalents thereof in a plant (for example, dicots (such as solanaceous and 
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cruciferous plants), monocots, gymnosperms, and algae). Such probes are useful to 
isolate DNA sequences that encode C-14 sterol reductases from other plants. In one 
particular example, oligonucleotides may be designed based on a C-14 sterol reductase 
sequence disclosed herein and used as hybridization probes or as primers in 
5 polymerase chain reactions (PCR). Conserved regions in the C-14 sterol reductase 
gene are useful in the design of such primers to facilitate the recovery of C-14 sterol 
reductases from other related and unrelated plants. 

In yet other related aspects, the invention features a transgenic plant (or seeds 
or cells thereof) containing DNA encoding a C-14 sterol reductase polypeptide 

10 integrated into the genome of the plant, where the DNA is expressed in the transgenic 
plant, resulting in the production of a C-14 sterol reductase polypeptide. 

In still another aspect, the invention features a method for reducing the level of 
a plant C-14 sterol reductase polypeptide in a transgenic plant cell. This method 
generally involves expressing in the transgenic plant cell an antisense C-14 sterol 

15 reductase polypeptide nucleic acid sequence. In general, such an antisense C-14 sterol 
reductase nucleic acid sequence is encoded by a transgene integrated into the genome 
of the transgenic plant cell and is based on the nucleotide sequence that is shown in 
Fig. 14 (SEQ ID NO: 2) or Fig. 15. (SEQ ID NO: 3). In preferred embodiments, the 
plant cell expressing an antisense C-14 sterol reductase nucleic acid sequence is a 

20 dicot (for example, crucifer), monocot, gymnosperm, or algal cell. In yet other 
preferred embodiments, the method involves growing a transgenic plant from the 
transgenic plant cell, whereby the level of the C-14 sterol reductase polypeptide is 
reduced in the transgenic plant. 

In other related aspects, the invention features a plant cell expressing an 

25 antisense C-14 sterol reductase nucleic acid sequence and a plant expression vector 
that includes an antisense C-14 sterol reductase nucleic acid sequence, where the 
antisense sequence is operably linked to an expression control region. 
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In another aspect, the invention features a method for increasing the level of a 
C-14 sterol reductase in a transgenic plant cell. This method involves expressing in 
the transgenic plant cell a C-14 sterol reductase polypeptide nucleic acid sequence. 
Preferably, the method utilizes a C-14 sterol reductase nucleic acid sequence that is 
5 substantially identical to the nucleotide sequence that is shown Fig. 14 (SEQ ID NO: 
2). In preferred embodiments, the plant cell expressing a C-14 sterol reductase 
polypeptide nucleic acid sequence is a dicot (for example, a crucifer), monocot, 
gymnosperm, or algal cell. 

In another aspect, the invention features a transgenic plant having a knockout 
10 mutation in DNA encoding a plant C-14 sterol reductase polypeptide. Such knockout 
genes are constructed according to conventional methods 
(e.g., Lee et al. Plant Cell 2: 415, 1990; Miao and Lam, Plant J. 7: 359, 1995). 

By "plant C-14 sterol reductase" is meant an amino acid sequence that 
catalyzes the reduction of any sterol precursor having a C14=C1 5 double bond, for 
15 example, as described by Benveniste, Annu. Rev. Biochem. 37: 275, 1986. Preferably, 
such a polypeptide has an amino acid sequence which is at least 30%, preferably 40%, 
and most preferably 50% or even 80-95% identical to the amino acid sequence of the 
C-14 sterol reductase polypeptide shown in Fig. 14 (SEQ ID NO: 1). The length of 
comparison of amino acid sequences will generally be at least 16 amino acids, 
20 preferably at least 20 amino acids, more preferably at least 25 amino acids, and most 
preferably at least 35 amino acids. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of 
length or post-translational modification (for example, glycosylation or 
phosphory 1 ati on) . 

25 By a "substantially identical" polypeptide sequence is meant an amino acid 

sequence that differs only by conservative amino acid substitutions, for example, 
substitution of one amino acid for another of the same class (e.g., valine for glycine, 
arginine for lysine, etc.) or by one or more non-conservative substitutions, deletions, 
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or insertions, located at positions of the amino acid sequence that do not destroy the 
function of the polypeptide (assayed, for example, as described herein). 

Sequence identity is typically measured using sequence analysis software (for 
example, Sequence Analysis Software Package of the Genetics Computer Group 
5 (University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705), BLAST, or PILEUP/PRETTYBOX programs). Such software matches 
sequences by assigning degrees of homology to various substitutions, deletions, and 
other modifications. 

By "substantially pure polypeptide" is meant a polypeptide preparation that is at 

10 least 60% by weight (dry weight) the compound of interest, for example, the C-14 
sterol reductase polypeptide or C-14 sterol reductase-specific antibody. Preferably, 
the preparation is at least 75%, more preferably at least 90%, and most preferably at 
least 99%, by weight the compound of interest. Purity can be measured by any 
appropriate method, for example, column chromatography, polyacrylamide gel 

15 electrophoresis, or HPLC analysis. 

By "purified DNA" is meant DNA that is not immediately contiguous with both 
of the coding sequences with which it is immediately contiguous (one on the 5' end 
and one on the 3 1 end) in the naturally-occurring genome of the organism from which 
it is derived. The term therefore includes, for example, a recombinant DNA that is 

20 incorporated into a vector; into an autonomously replicating plasmid or virus; or into 
the genomic DNA of a prokaryote or eukaryote, or that exists as a separate molecule 
(for example, a cDNA or a genomic DNA fragment produced by PCR or restriction 
endonuclease treatment) independent of other sequences. It also includes a 
recombinant DNA that is part of a hybrid gene encoding one or more additional amino 

25 acids. 

By a "substantially identical" nucleic acid is meant a nucleic acid sequence that 
encodes a polypeptide differing only by conservative amino acid substitutions, for 
example, substitution of one amino acid for another of the same class (e.g., valine for 
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glycine, arginine for lysine, etc.) or by one or more non-conservative substitutions, 
deletions, or insertions, located at positions of the amino acid sequence that do not 
destroy the function of the polypeptide (assayed, for example, as described herein). 
Again, the encoded sequence is at least 30%, more preferably 40%, and most 
5 preferably 50%, or even 80 to 95% identical at the amino acid level to the sequence of 
Fig. 14 (SEQ ID NO: 1). Thus, when nucleic acid sequences are compared, a 
"substantially identical" nucleic acid sequence is one which is at least 30%, more 
preferably 40%, and most preferably 50%, or even 80 to 95% identical to the sequence 
of Fig. 14 (SEQ ID NO: 2). The length of nucleic acid sequence comparison will 

10 generally be at least 30 nucleotides, preferably at least 60 nucleotides, more preferably 
at least 75 nucleotides, and most preferably 1 10 nucleotides. Again, identity is 
typically measured using sequence analysis software (for example, Sequence Analysis 
Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wl 53705). 

15 By "isolated antibody" is meant antibody that is at least 60%, by weight, free 

from the proteins and naturally-occurring organic molecules with which it is naturally 
associated. Preferably, the preparation is at least 75%, more preferably at least 90%, 
and most preferably at least 99%, by weight, antibody. 

By "specifically binds" is meant an antibody that recognizes and binds a C-14 

20 sterol reductase polypeptide but which does not substantially recognize and bind other 
molecules in a sample (e.g., a biological sample) which naturally includes a C-14 
sterol reductase. An antibody which "specifically binds" a C-14 sterol reductase is 
sufficient to detect a C-14 sterol reductase product in such a biological sample using 
one or more of the standard immunological techniques available to those in the art (for 

25 example, Western blotting or immunoprecipitation). 

By "an antisense C-14 sterol reductase sequence" is meant a nucleotide 
sequence that is complementary to a plant C-14 sterol reductase messenger RNA. In 
general, such an antisense sequence will usually be at least 15 nucleotides, preferably 
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about 15-200 nucleotides, and more preferably 200-2,000 nucleotides in length. The 
antisense sequence may be complementary to all or a portion of the plant C-14 sterol 
reductase mRNA nucleotide sequence, and, as appreciated by those skilled in the art, 
the particular site or sites to which the antisense sequence binds as well as the length 
5 of the antisense sequence will vary, depending upon the degree of inhibition desired 
and the uniqueness of the antisense sequence. By binding to the appropriate target 
sequence, an RNA-RNA, DNA-DNA, or RNA-DNA duplex is formed. A 
transcriptional construct expressing a plant C-14 sterol reductase antisense nucleotide 
sequence includes, in the direction of transcription, a promoter, the sequence coding 

10 for the antisense RNA on the sense strand, and a transcriptional termination region. 
Antisense C- 1 4 sterol reductase sequences may be constructed and expressed as 
described herein or as described, for example, in van der Krol et al., Gene 72: 45, 
1988; Rodermel et al., Cell 55: 673, 1988; Mol et al, FEBS Lett 268: 427, 1990; 
Weigel and Nilsson, Nature 377: 495, 1995; Cheung et al., Cell 82, 383, 1995; and 

15 U.S. Pat. No. 5,107,065. In addition, C-14 sterol reductase antisense sequences are 
useful for the formation of triple helices, where the antisense sequence is bound to a 
DNA duplex. By binding to the target nucleic acid, C-14 sterol reductase antisense 
sequences can inhibit the function of the target nucleic acid. This results, for example, 
in the blocking of transcription, processing of poly A+ addition, replication, 

20 translation, or promoting inhibitory mechanisms of the cell, such as RNA degradation. 
The triple helix-forming and antisense C-14 sterol reductase sequences are useful for 
selectively suppressing certain cellular functions that are associated with C-14 sterol 
reductase activity. 

By a "transformed cell M is meant a cell into which (or into an ancestor of 
25 which) has been introduced, by means of recombinant DNA techniques, a DNA 

molecule encoding (as used herein) a C-14 sterol reductase polypeptide (for example, 
a substantially identical DNA encoding the C-14 sterol reductase shown in Fig. 14 
(SEQ ID NO: 2)). 
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By "positioned for expression" is meant that the DNA molecule is positioned 
adjacent to a DNA sequence which directs transcription and translation of the 
sequence (for example, facilitates the production of, for example, a plant C-14 sterol 
reductase polypeptide such as the amino acid sequence shown in Fig. 14 (SEQ ID NO: 
5 1)), or an RNA molecule (for example, an antisense RNA). 

By "promoter" is meant a minimal sequence sufficient to direct transcription. 
Included in the invention are promoter elements that are sufficient to render promoter- 
dependent gene expression controllable for cell-, tissue-, or organ-specific gene 
expression, or elements that are inducible by external signals or agents (for example, 
10 light-, pathogen-, wound-, stress-, or hormone-inducible elements); such elements may 
be located in the 5' or 3* regions of the native gene or engineered into a transgene 
construct. 

By "operably linked" is meant that a gene and a regulatory sequence(s) are 
connected in such a way as to permit gene expression when the appropriate molecules 
15 (for example, transcriptional activator proteins) are bound to the regulatory 
sequence(s). 

By "crucifer" is meant any plant that is classified within the Cruciferae family 
as commonly described in, e.g., Gray's Manual of Botany American Book Company, 
N.Y., 1950; Hortus Third: A Concise Dictionary of Plants Cultivated in the U.S. and 
20 Canada, Macmillan, 1976; or Simmons, N.W., Evolution of Crop Plants, 1986. The 
Cruciferae include many agricultural crops, including, but not limited to, broccoli, 
cabbage, brussel sprouts, rapeseed, kale, Chinese kale, cauliflower, horseradish, and 
Arabidopsis. 

By "plant cell" is meant any self-propagating cell bounded by a semi-permeable 
25 membrane and containing a plastid. Such a cell also requires a cell wall if further 

propagation is desired. Plant cell, as used herein includes, without limitation, algae, 
cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, callus 
tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 
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By ,! transgene n is meant any piece of DNA that is inserted by artifice into a 
cell, and becomes part of the genome of the organism which develops from that cell. 
Such a transgene may include a gene which is partly or entirely heterologous (i.e., 
foreign) to the transgenic organism, or may represent a gene homologous to an 
5 endogenous gene of the organism. 

By "transgenic" is meant any cell that includes a DNA sequence which is 
inserted by artifice into a cell and becomes part of the genome of the organism which 
develops from that cell. As used herein, the transgenic organisms are generally 
transgenic plants and the DNA (transgene) is inserted by artifice into the nuclear or 
1 0 plastidic genomes. 

Other features and advantages of the invention will be apparent from the 
following detailed description thereof, and from the claims. 



15 Detailed Description 

The drawings will first be described. 
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Drawings 

Figs. 1A-1B are photographs showing that the ell mutant morphology was 
phenocopied by treating wild-type seedlings with 30 juM of dimethylallylamino-purine 
(2ip), a synthetic cytokinin, in the dark (Fig. 1A) or light (Fig. IB). From left to right 
in both Figs. 1A-1B: wild-type plant; wild-type plant + 2ip; ell; and ell + 2ip. 

Fig. 2 is a photograph illustrating the constitutive photomorphogenesis of ell 
seedling development in the dark. Wild-type (left) and ell (right) seedlings were 
grown in the dark for twenty-one days on Murashige-Skoog (MS) plates containing 
two percent sucrose. 

Figs. 3A-3B are photographs showing that the rosette leaves of the ell plant 
(Fig. 3B) are darker green in color that those of the wild-type plant (Fig. 3A). 

Fig. 4 is a photograph illustrating that an ell mutant has reduced apical 
dominance in comparison to a wild-type plant. Six-week-old wild-type (left) and ten- 
week-old ell (right) plants were grown in the greenhouse. 

Fig. 5 is a photograph showing that ell mutants (right) exhibit irregular, 
thickened cotyledons and hypocotyls, and reduced cotyledon petioles compared to 
wild-type plants (left). 

Figs. 6A-6B are photographs showing abnormal flower development in the ell 
mutant. Fig. 6A shows, from left to right, that the sepal, petal, stamen, and carpel are 
shorter in ell (lower row) than wild-type (upper row) plants. Fig. 6B shows, from left 
to right, the top and side view of wild-type (left) and ell (right) flowers. 

Figs. 7A-7F are photographs showing embryo development in ell and wild- 
type plants. Wild-type and ell plants are shown in the left and right of each 
photograph, respectively. Fig. 7A shows ell embryo development at the 32- to 64-cell 
stage, and Fig. 7B shows that, when wild-type embryos have reached the heart stage, 
ell embryos are only at the globular stage. As shown in Fig. 7C and Fig. 7D, when 
the wild-type embryo reached the torpedo stage, the ell mutant embryo was at the 
heart stage. Fig. 7E shows that apical hooks were not formed in ell embryos. And 
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Fig. 7F shows that ell seeds desiccated without completing the late stages of 
embryogenesis. 

Figs. 8A-8B are photographs of dry seeds from wild-type (Fig. 8A) and ell 
plants (Fig. 8B). Reduced seed size, wrinkled seed coat, and precocious germination 
5 were observed in ell seeds. 

Figs. 9A-9F are photographs showing the supernumerary cotyledons that were 
observed in the ell mutant, including one (Fig. 9 A), two (Fig. 9B), three (Fig. 9C), 
four (Fig. 9D), five (Fig. 9E), and more than six cotyledons (Fig. 9F). 

Figs. 10A-10B are illustrations showing various aspects of the molecular 
10 characterization of the Arabidopsis thaliana C-14 sterol reductase gene. Fig. 10A is a 
schematic illustration showing the position of a T-DNA insertion into chromosome 3 
of Arabidopsis , approximately forty base pairs upstream of the ELL gene, and the 
exon-intron structure of the C-14 sterol reductase gene. 

Fig. 10B is a schematic illustration showing the map position of ELL on chromosome 
15 3. 

Fig. 11 is a schematic illustration showing a comparison of the predicated ELL 
amino acid sequence (designated Ath; SEQ ID NO: 1 ) with C-14 sterol reductase of 
Saccharomyces cerevisiae (Erg24) and Schizosaccharomyces pombe (Pombe), and C- 
24 sterol reductase of Sz. pombe (Stsl) and S. cerevisiae (Ygl022). 
20 Fig. 12 is a schematic illustration showing that the predicted ELL amino acid 

sequence (designated Ath; SEQ ID NO: 1 ) shares homology to human and chicken 
lamin B receptor. 

Figs. 13A-13B are photographs showing that the ell phenotype was not 
corrected by exogenous feeding of brassinolide (1 /jM) in either dark (Fig. 13 A) or 
25 light (Fig. 13B). From left to right in Figs. 13A-13B: wild-type; ell; wild-type + 
brassinolide; and ell + brassinolide. 
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Fig- 14 is a schematic illustration showing the nucleotide sequence of an 
Arabidopsis C-14 sterol reductase (SEQ ID NO: 2) and its deduced amino acid 
sequence polypeptide (SEQ ID NO: 1). 

Fig. 15 is a schematic illustration showing the genomic nucleotide sequence of 
an Arabidopsis C-14 sterol reductase polypeptide (SEQ ID NO: 3). 

Fig. 16 is a schematic illustration showing the sequence comparison between 
the genomic nucleotide sequence (SEQ ID NO: 3) and cDNA sequences (SEQ ID NO: 
2) of an Arabidopsis C-14 sterol reductase. 



10 There now follows a description of an Arabidopsis mutant, ell (extra long life), 

that displays a life span that is at least three times greater than wild-type plants. The 
ell mutant was isolated by T-DNA tagging methods and was shown to encode a novel 
C-14 sterol reductase. This example is provided for the purpose of illustrating the 
invention, and should not be construed as limiting. 

15 

Identification and Developmental Effects of the ell MutatlPP By screening for 
mutants displaying BR deficiency or constitutive cytokinin activity, a recessive 
mutation causing pleiotropic developmental effects was identified according to 
conventional methods in an Arabidopsis T-DNA insertional mutant collection 

20 (Feldmann, Plant J. 1:71, 1991; Errampalli et al., Plant Cell 3: 149, 1991). This 

mutant, termed "ell", was found to have a number of developmental abnormalities. 

For example, unlike wild-type plants, ell mutants displayed constitutive light- 
morphogenesis (Fig. 2), similar to the Arabidopsis det2 (Chory et al., Plant Cell 3: 
445, 1991) and cpd (Szekeres et al., Cell 85: 171, 1996) mutants. In addition, 

25 compared to wild-type plants, ell plants had darker green rosette leaves (Figs. 3A-3B), 
reduced apical dominance (Fig. 4), stunted hairy roots, and irregular hypocotyl and 
cotyledons (Fig. 5). Furthermore, as shown in Figs. 6A-6B, the ell mutant showed 
reduced and ruffled sepals and petals. The ell mutant also showed delayed and altered 
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embryo development (Figs. 7A-7F) and was found to have reduced fertility, producing 
wrinkled seeds that precociously germinated (Figs. 8A-8B). In addition, the various 
phenotypes of ell overlapped with amp- J (ptJ) (Chaudhury et aL, Plant J. 4: 907, 
1993) and hauptling (Jtirgens et a!., Ann. Rev. Genet. 28: 351, 1994), including 
5 supernumerary cotyledons (Figs. 9A-9F). 

Finally, as shown in Figs. 1A-1B, the morphology of the T-DNA tagged ell 
mutant seedlings was phenocopied by treating wild-type seedlings with 30 \iM 
dimethylallylamine purine (2ip), a synthetic cytokinin. 

Despite having a number of developmental abnormalities, ell mutants were 

10 found to have a life span that was at least three times greater than wild-type plants. 
Genetic Analysis and Molecular Cloning of ELL 

Standard segregation analysis indicated that ell is a recessive mutation. The T2 
population of the transgenic line carrying the ell mutant showed a 3: 1 Mendelian 
segregation of the T-DNA using kanamycin resistance (kan r ) as a selectable marker. 

15 Of the kan r plants, thirty-three percent showed the ell phenotype, indicating that the ell 
mutation was recessive. A T3 population was then generated from selfed T2 kan r 
plants having the wild-type phenotype, and the kan 1 marker showed a 3:1 segregation. 
Of the seventy-five percent displaying kan r , twenty-five percent showed the ell 
phenotype. Because ell homozygous plants were found to be either lethal or sterile, 

20 T2 heterozygous ell plants were subsequently backcrossed to wild-type plants for 
additional segregation analysis. The resulting Fl population from this backcross 
showed a 1 : 1 segregation of the kan r marker; no plants were observed having the ell 
phenotype. The Fl kan r individuals of the backcross were then selfed to produce an 
F2 population. Seventy-five percent of this F2 population was found to be kan r , and 

25 thirty-three percent of the kan r resistant plants showed the ell phenotype, confirming 
the recessive nature of this mutant. Consistent segregation of the ell phenotype and 
kan r marker was also observed in a subsequent backcross, further indicating that ell 
was tagged by the T-DNA. 
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Genomic DNA blot analysis, using an NPTII probe derived from the T-DNA 
vector, showed that a unique single copy of T-DNA was integrated into the ell 
genome. This result, together with the segregation data described above, further 
indicated that the ell phenotype was associated with the kan r marker, and that the ell 
5 mutation resulted from a single T-DNA insertion in the Arabidopsis genome. 

The T-DNA-tagged locus was then isolated by constructing a genomic DNA 
library from the ell mutant and was mapped by hybridization using the NPTII probe. 
Fig. 10A shows the physical map of the T-DNA tagged locus that was determined by 
DNA hybridization. One of three genomic clones that were found to hybridize to the 

10 NPTII probe was partially sequenced and found to have a complete T-DNA insertion 
and flanking plant sequences. A segment of this genomic clone containing both T- 
DN A and plant sequences was then used to screen a genomic library that was prepared 
from wild-type plants. Two positive clones that were identified in this screen were 
then sequenced. The genomic nucleotide sequence is presented in Fig. 15 (SEQ ID 

15 NO: 3). 

The T-DNA-plant DNA insert junctions were also used as probes to screen a 
cDNA library that was prepared from wild-type plants. One isolated cDNA clone, 
designated D13, was found to have a nucleotide sequence (SEQ ID NO: 1) that 
matched the genomic sequences flanking the right T-DNA border. Comparison of the 

20 cDNA (Fig. 14) with the genomic DNA sequence (Fig. 1 5) also revealed that the T- 
DNA was inserted at a location forty base pairs upstream of the 5' end of the ELL 
cDNA transcript (Fig. 16). The complete genomic fragment covering the cDNA 
sequence was composed of 14 exons and 13 introns (Fig. 10A). Probes that were 
prepared from both the cDNA or genomic clone were then used for DNA blot 

25 analysis. Results from this analysis confirmed that the ELL gene was of plant origin. 

We also determined the chromosomal position of ELL by standard segregation 
analysis of restriction fragment length polymorphisms (RFLPs) in recombinant inbred 
lines (Nam et al., Plant Cell 1: 699, 1989; Lister and Dean, Plant J. 4: 745, 1993; 
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Hauge et al., Plant J. 3: 745, 1993; Schmidt et al., Science 270: 480, 1995; Zachgo et 
al., Genomic Res. 6: 19, 1996). By this analysis, we found that ELL is located on 
chromosome 3 and is flanked by the chromosomal markers by mi456 and g2778 (Fig. 
10B). 



ELL Encodes a Novel C-14 Sterol Reductase 

A comparison of the deduced polypeptide sequence of the full-length ELL 
cDNA clone to the GenBank database showed that ELL had 35% identity to C-14 
sterol reductase (Erg24) in yeast (Lorenz and Parks, DNA Cell Biol 9: 685, 1992; Lai 

10 et al., Gene 140: 41, 1994) (Fig. 1 1) and 40% identity to the lamin B receptor (LBR) 
in humans (Ye and Worman, J. Biol Chem. 269: 1 1306, 1994) (Fig. 12). In addition, 
the amino acid sequence of ELL predicted several hydrophobic regions and between 
eight to nine transmembrane domains, consistent with the yeast Erg24 and human 
LBR. However, ELL was observed to lack a basic nucleoplasmic amino-terminal 

15 domain of about 200 amino acids that has been identified in human LBR. Database 

searches also revealed that at least two Arabidopsis expression sequence tagged (EST) 
clones (GenBank accession numbers T4501 1 and T42407) shared homology to ELL. 
DNA sequencing revealed that T4501 1 encodes an unknown gene with 60% 
nucleotide sequence identity to ELL. The predicted amino acid sequence of T4501 1 

20 was also observed to have greater than 50% identity to the yeast ERG24 and human 

LBR. These results further confirmed that ELL is encoded by a gene that is a member 
of the C-14 sterol reductase gene family. T42407 was found to encode an Arabidopsis 
sterol A7-reductase (Lecain et al., J. Biol Chem. 271 : 10866, 1996) that shares 32% 
amino acid identity to ELL. 

25 RNA blot analysis indicated multiple transcripts hybridizing to the full-length 

ELL cDNA. 

To determine whether the ell mutant phenotype is corrected by exogenous 
feeding of brassinolide, we germinated ell seedlings on agar plates containing 1 (iM 



BNSDOCID: <WO 9748793A1 IA> 



WO 97/48793 




PCT/US97/10644 



-16- 

brassinolide or 1 ^iM 24-epibrassinolide (Li et ah, Science 272: 398, 1996). The 
results of these experiments showed that the presence of brassinolide or 24- 
epibrassinolide, in the growth medium of ell plants did not alter the mutant phenotype 
(Fig. 13A-13B). Thus, it appears that steroid compounds other than BRs are needed to 
5 restore an ell mutant to a normal growth and development phenotype, as reflected by 
the pleiotropic phenotypes such as stunted roots (Fig. 2) and impaired embryogenesis 
(Fig. 7A-7F). 

To confirm that ELL activity was upstream of DET2 in the sterol biosynthesis 
pathway, a double mutant between ell and det2 was constructed and analyzed. The 
10 phenotype of det2/ell was indistinguishable from ell, further supporting the hypothesis 
that DET2 was epi static to ELL. 



Isolation of Other C-14 S terol Reductase cDNAs and Genomic DNAs 

Based on the C-14 sterol reductase genes and polypeptides described herein, 

15 the isolation of additional plant C-14 sterol reductase coding sequences is made 

possible using standard strategies and techniques that are well known in the art. For 
example, using all or a portion of the amino acid sequence of a C-14 sterol reductase 
polypeptide, one may readily design C-14 sterol reductase-specific oligonucleotide 
probes, including C-14 sterol reductase degenerate oligonucleotide probes (i.e., a 

20 mixture of all possible coding sequences for a given amino acid sequence). These 
oligonucleotides may be based upon the sequence of either DNA strand and any 
appropriate portion of the C-14 sterol reductase sequence (for example, Fig. 14; SEQ 
ID NOS: 2 and 1, respectively; and Fig. 15 (SEQ ID NO: 3). General methods for 
designing and preparing such probes are provided, for example, in Ausubel et al., 

25 1996, Current Protocols in Molecular Biology, Wiley Interscience, New York, and 

Berger and Kimmel, Guide to Molecular Cloning Techniques, 1987, Academic Press, 
New York. These oligonucleotides are useful for C-14 sterol reductase gene isolation, 
either through their use as probes capable of hybridizing to C-14 sterol reductase 
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complementary sequences or as primers for various amplification techniques, for 
example, polymerase chain reaction (PCR) cloning strategies. 

Hybridization techniques and screening procedures are well known to those 
skilled in the art and are described, for example, in Ausubel et al. (supra); Berger and 
5 Kimmel (supra); and Sambrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, New York. If desired, a combination of 
different oligonucleotide probes may be used for the screening of a recombinant DNA 
library. The oligonucleotides may be detectably-labeled using methods known in the 
art and used to probe filter replicas from a recombinant DNA library. Recombinant 
10 DNA libraries are prepared according to methods well known in the art, for example, 
as described in Ausubel et al. (supra), or they may be obtained from commercial 
sources. 

For detection or isolation of closely related C-14 sterol reductase sequences 
having greater than 80% identity, high stringency conditions are preferably used; such 

15 conditions include hybridization at about 65 °C and about 50% formamide, a first 

wash at about 65 °C, about 2X SSC, and 1% SDS, followed by a second wash at about 
65 °C and about 0. 1 % SDS, and 0. IX SSC. Lower stringency conditions for detecting 
C-14 sterol reductase genes having about 40-50% sequence identity to the C-14 sterol 
reductase genes described herein include, for example, hybridization at about 37°C in 

20 the absence of formamide, a first wash at about 37 °C, about 6X SSC, and about 1% 
SDS, and a second wash at about 37°C, about 6X SSC, and about 1% SDS. These 
stringency conditions are exemplary; other appropriate conditions may be determined 
by those skilled in the art. 

As discussed above, C-14 sterol reductase oligonucleotides may also be used as 

25 primers in amplification cloning strategies, for example, using PCR. PCR methods 
are well known in the art and are described, for example, in PCR Technology, Erlich, 
ed., Stockton Press, London, 1989; PCR Protocols: A Guide to Methods and 
Applications, Innis et al., eds., Academic Press, Inc., New York, 1990; and Ausubel et 
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al. (supra). Primers are optionally designed to allow cloning of the amplified product 
into a suitable vector, for example, by including appropriate restriction sites at the 5* 
and 3' ends of the amplified fragment (as described herein). If desired, C-14 sterol 
reductase sequences may be isolated using the PCR "RACE" technique, or Rapid 
5 Amplification of cDNA Ends (see, e.g., Innis et al. (supra)). By this method, 

oligonucleotide primers based on a C-14 sterol reductase sequence are oriented in the 
y and 5 f directions and are used to generate overlapping PCR fragments. These 
overlapping 3 - and 5'-end RACE products are combined to produce an intact full- 
length cDNA. This method is described in Innis et aL (supra)', and Frohman et al., 
10 Proc. Natl. Acad. Sci. USA 85: 8998, 1988. 

Alternatively, any plant cDNA expression library may be screened by 
functional complementation of a yeast C-14 reductase mutant (for example, the erg24 
mutant described by Lorenz and Parks, DNA Cell Biol. 9: 685, 1992) according to 
standard methods. 

15 Useful C-14 sterol reductase sequences may be isolated from any appropriate 

organism. Confirmation of a sequence's relatedness to the C-14 sterol reductase 
polypeptide family may be accomplished by a variety of conventional methods 
including, but not limited to, functional complementation assays and sequence 
comparison. In addition, the activity of any C-14 sterol reductase sequence may be 

20 evaluated according to any of the techniques described herein. 

C- 1 4 Sterol Reductase Polypeptide Expression 

C- 1 4 sterol reductase polypeptides may be produced by transformation of a 
suitable host cell with all or part of a C-14 sterol reductase cDNA (for example, the 
25 cDNA described above) in a suitable expression vehicle or with a plasmid construct 
engineered for increasing the expression of a 
C-14 sterol reductase polypeptide (supra) in vivo. 
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Those skilled in the field of molecular biology will understand that any of a 
wide variety of expression systems may be used to provide the recombinant protein. 
The precise host cell used is not critical to the invention. The C-I4 sterol reductase 
protein may be produced in a prokaryotic host, for example, E. coli, or in a eukaryotic 
5 host, for example, Saccharomyces cerevisiae, mammalian cells (for example, COS 1 
or NIH 3T3 cells), or any of a number of plant cells including, without limitation, 
algae, tree species, ornamental species, temperate fruit species, tropical fruit species, 
vegetable species, legume species, monocots, dicots, or in any plant of commercial or 
agricultural significance. Particular examples of suitable plant hosts include, but are 

10 not limited to, Conifers, Petunia, Tomato, Potato, Tobacco, Arabidopsis, Lettuce, 
Sunflower, Oilseed rape, Flax, Cotton, Sugarbeet, Celery, Soybean, Alfalfa, 
Medicago, Lotus, Vigna, Cucumber, Carrot, Eggplant, Cauliflower, Horseradish, 
Morning Glory, Poplar, Walnut, Apple, Asparagus, Rice, Maize, Millet, Onion, 
Barley, Orchard grass, Oat, Rye, and Wheat. 

1 5 Such cells are available from a wide range of sources including the American 

Type Culture Collection (Rockland, MD); or from any of a number seed companies, 
for example, W. Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. 
(Greenwood, SC), Johnny Seed Co. (Albion, ME), or Northrup King Seeds 
(Harstville, SC). Descriptions and sources of useful host cells are also found in Vasil 

20 I.K., Cell Culture and Somatic Cell Genetics of Plants, Vol I, II, III Laboratory 

Procedures and Their Applications Academic Press, New York, 1984; Dixon, R.A., 
Plant Cell Culture-A Practical Approach, IRL Press, Oxford University, 1985; Green 
et aL, Plant Tissue and Cell Culture, Academic Press, New York, 1987; and Gasser 
and Fraley, Science 244: 1293, 1989. 

25 For prokaryotic expression, DNA encoding a C-l 4 sterol reductase polypeptide 

is carried on a vector operably linked to control signals capable of effecting expression 
in the prokaryotic host. If desired, the coding sequence may contain, at its 5' end, a 
sequence encoding any of the known signal sequences capable of effecting secretion 
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of the expressed protein into the periplasmic space of the host cell, thereby facilitating 
recovery of the protein and subsequent purification. Prokaryotes most frequently used 
are various strains of E. coli\ however, other microbial strains may also be used. 
Plasmid vectors are used which contain replication origins, selectable markers, and 
5 control sequences derived from a species compatible with the microbial host. 

Examples of such vectors are found in Pouwels et al. (supra) or Ausubel et al. (supra). 
Commonly used prokaryotic control sequences (also referred to as "regulatory 
elements") are defined herein to include promoters for transcription initiation, 
optionally with an operator, along with ribosome binding site sequences. Promoters 

10 commonly used to direct protein expression include the beta- lactamase (penicillinase), 
the lactose (lac) (Chang et al., Nature 198: 1056, 1977), the tryptophan (Trp) (Goeddel 
et al., Nucl. Acids Res. 8: 4057, 1980), and the ia£ promoter systems, as well as the 
lambda-derived P L promoter and N-gene ribosome binding site (Simatake et al., 
Nature 292: 128, 1981). 

15 One particular bacterial expression system for C-14 sterol reductase 

polypeptide production is the E. coli pET expression system (Novagen, Inc., Madison 
WI). According to this expression system, DNA encoding a C-14 sterol reductase 
polypeptide is inserted into a pET vector in an orientation designed to allow 
expression. Since the C-14 sterol reductase gene is under the control of the T7 

20 regulatory signals, expression of C- 14 sterol reductase is induced by inducing the 

expression of T7 RNA polymerase in the host cell. This is typically achieved using 
host strains which express T7 RNA polymerase in response to IPTG induction. Once 
produced, recombinant C-14 sterol reductase polypeptide is then isolated according to 
standard methods known in the art, for example, those described herein. 

25 Another bacterial expression system for C-14 sterol reductase polypeptide 

production is the pGEX expression system (Pharmacia). This system employs a GST 
gene fusion system which is designed for high-level expression of genes or gene 
fragments as fusion proteins with rapid purification and recovery of functional gene 
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products. The protein of interest is fused to the carboxyl terminus of the glutathione 
S-transferase protein from Schistosoma japonicum and is readily purified from 
bacterial lysates by affinity chromatography using Glutathione Sepharose 4B. Fusion 
proteins can be recovered under mild conditions by elution with glutathione. 
5 Cleavage of the glutathione S-transferase domain from the fusion protein is facilitated 
by the presence of recognition sites for site-specific proteases upstream of this 
domain. For example, proteins expressed in pGEX-2T plasmids may be cleaved with 
thrombin; those expressed in pGEX-3X may be cleaved with factor Xa. 

For eukaryotic expression, the method of transformation or transfection and the 

10 choice of vehicle for expression of the C-14 sterol reductase polypeptide will depend 
on the host system selected. Transformation and transfection methods are described, 
e.g., in Ausubel et al. (supra); Weissbach and Weissbach, Methods for Plant 
Molecular Biology, Academic Press, 1989; Gelvin et al., Plant Molecular Biology 
Manual, Kluwer Academic Publishers, 1990; Kindle, K., Proc. Natl Acad. ScL, U.S. A 

1 5 87: 1228, 1990; Potrykus, I., Annu. Rev. Plant Physiol. Plant Mol. Biology 42: 205, 
1991; and BioRad (Hercules, CA) Technical Bulletin #1687 (Biolistic Particle 
Delivery Systems). Expression vehicles may be chosen from those provided, e.g., in 
Cloning Vectors: A Laboratory Manual (P.H. Pouwels et al., 1985, Supp. 1987); 
Gasser and Fraley (supra); Clontech Molecular Biology Catalog (Catalog 1992/93 

20 Tools for the Molecular Biologist, Palo Alto, CA); and the references cited above. 

Most preferably, an C-14 sterol reductase polypeptide is produced by a stably- 
transfected plant cell line, a transiently-transfected plant cell line, or by a transgenic 
plant. A number of vectors suitable for stable transfection of plant cells or for the 
establishment of transgenic plants are available to the public; such vectors are 

25 described in Pouwels et al. (supra), Weissbach and Weissbach (supra), and Gelvin et 
al. (supra). Methods for constructing such cell lines are described in, e.g., Weissbach 
and Weissbach (supra), and Gelvin et al. (supra). Typically, plant expression vectors 
include (1) a cloned plant gene under the transcriptional control of 5 ? and 3* regulatory 
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sequences and (2) a dominant selectable marker. Such plant expression vectors may 
also contain, if desired, a promoter regulatory region (for example, one conferring 
inducible or constitutive, environmentally- or developmentally-regulated, or cell- or 
tissue-specific expression), a transcription initiation start site, a ribosome binding site, 
5 an RNA processing signal, a transcription termination site, and/or a polyadenylation 
signal. 

Alternatively, the C-14 sterol reductase polypeptide may be produced using a 
transient expression system (e.g., the maize transient expression system described by 
Sheen, Plant Cell!: 1027, 1990). 
10 Once the desired C-14 sterol reductase nucleic acid sequences is obtained, it 

may be manipulated in a variety of ways known in the art. For example, where the 
sequence involves non-coding flanking regions, the flanking regions may be subjected 
to mutagenesis. 

The C-14 sterol reductase DNA sequence of the invention may, if desired, be 
15 combined with other DNA sequences in a variety of ways. The C-14 sterol reductase 
DNA sequence of the invention may be employed with all or part of the gene 
sequences normally associated with the C-14 sterol reductase protein. In its 
component parts, a DNA sequence encoding a C-14 sterol reductase protein is 
combined in a DNA construct having a transcription initiation control region capable 
20 of promoting transcription and translation in a host cell. 

In general, the constructs will involve regulatory regions functional in plants 
which provide for modified production of C-14 sterol reductase protein as discussed 
herein. The open reading frame coding for the C-14 sterol reductase protein or 
functional fragment thereof will be joined at its 5' end to a transcription initiation 
25 regulatory region such as the sequence naturally found in the 5' upstream region of the 
C-14 sterol reductase structural gene. Numerous other transcription initiation regions 
are available which provide for constitutive or inducible regulation. 
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For applications where developmental, cell, tissue, hormonal, or environmental 
expression is desired, appropriate 5' upstream non-coding regions are obtained from 
other genes, for example, from genes regulated during meristem development, seed 
development, embryo development, or leaf development. 
5 Regulatory transcript termination regions may also be provided in DNA 

constructs of this invention as well. Transcript termination regions may be provided 
by the DNA sequence encoding the C-14 sterol reductase protein or any convenient 
transcription termination region derived from a different gene source. The transcript 
termination region will contain preferably at least 1-3 kb of sequence 3' to the 

10 structural gene from which the termination region is derived. Plant expression 
constructs having C-14 sterol reductase as the DNA sequence of interest for 
expression (in either the sense or antisense orientation) may be employed with a wide 
variety of plant life, particularly plant life involved in the production of storage 
reserves (for example, those involving carbon and nitrogen metabolism). Such 

15 genetically-engineered plants are useful for a variety of industrial and agricultural 
applications as discussed below. Importantly, this invention is applicable to 
dicotyledons and monocotyledons, and will be readily applicable to any new or 
improved transformation or regeneration method. 

An example of a useful plant promoter according to the invention is a 

20 caulimovirus promoter, for example, a cauliflower mosaic virus (CaMV) promoter. 

These promoters confer high levels of expression in most plant tissues, and the activity 
of these promoters is not dependent on virally encoded proteins. CaMV is a source for 
both the 35S and 19S promoters. In most tissues of transgenic plants, the CaMV 35S 
promoter is a strong promoter (see, e.g., Odell et al., Nature 313: 810 1985). The 

25 CaMV promoter is also highly active in monocots (see, e.g., Dekeyser et aL, Plant 
Cell!: 591, 1990; Teradaand Shimamoto, Mol Gen. Genet 220: 389, 1990). 
Moreover, activity of this promoter can be further increased (i.e., between 2-10 fold) 
by duplication of the CaMV 35S promoter (see e.g., Kay et al., Science 236: 1299, 
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1987; Ow et ah, Proc. Natl Acad. ScL, U.S.A. 84: 4870, 1987; and Fang et al., Plant 
Cell\: 141, 1989). 

Other useful plant promoters include, without limitation, the nopaline synthase 
promoter (An et al., Plant Physiol. 88: 547, 1988) and the octopine synthase promoter 
5 (Fromm et al, Plant Cell 1: 977, 1989). 

For certain applications, it may be desirable to produce the C-14 sterol 
reductase gene product in an appropriate tissue, at an appropriate level, or at an 
appropriate developmental time. For this purpose, there are an assortment of gene 
promoters, each with its own distinct characteristics embodied in its regulatory 

10 sequences, shown to be regulated in response to the environment, hormones, and/or 
developmental cues. These include gene promoters that are responsible for heat- 
regulated gene expression (see, e.g., Callis et al, Plant Physiol. 88: 965, 1988; 
Takahashi and Komeda, Mol. Gen. Genet. 219: 365, 1989; and Takahashi et al. Plant 
J. 2: 751, 1992), light-regulated gene expression (e.g., the pea rbcSSA described by 

15 Kuhlemeier et ah, Plant Cell 1 : 47 1 , 1989; the maize rbcS promoter described by 

Schaffher and Sheen, Plant Cell 3: 997, 1991; or the cholorphyll a/b-binding protein 
gene found in pea described by Simpson et al., EMBO J. 4: 2723, 1985), hormone- 
regulated gene expression (for example, the abscisic acid (ABA) responsive sequences 
from the Em gene of wheat described by Marcotte et al., Plant Cell 1 : 969, 1989; the 

20 ABA-inducible HVA1 and HVA22, and rd29A promoters described for barley and 
Arabidopsis by Straub et al., Plant Cell 6: 617, 1994, Shen et al., Plant Cell 7: 295, 
1995; and wound-induced gene expression (for example, of wunl described by 
Siebertz et al., Plant Cell 1: 961, 1989), or organ-specific gene expression (for 
example, of the tuber-specific storage protein gene described by Roshal et al., EMBO 

25 J. 6: 11 55, 1987; the 23-kDa zein gene from maize described by Schernthaner et al., 
EMBO J, 7: 1249, 1988; or the French bean 6-phaseolin gene described by Bustos et 
al, Plant Cell 1: 839, 1989). 
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Plant expression vectors may also optionally include RNA processing signals, 
e.g, introns, which have been shown to be important for efficient RNA synthesis and 
accumulation (Callis et al., Genes and Dev. 1: 1 183, 1987). The location of the RNA 
splice sequences can dramatically influence the level of transgene expression in plants. 
5 In view of this fact, an intron may be positioned upstream or downstream of a C- 14 

sterol reductase polypeptide-encoding sequence in the transgene to modulate levels of 
gene expression. 

In addition to the aforementioned 5 f regulatory control sequences, the 
expression vectors may also include regulatory control regions which are generally 

10 present in the 3* regions of plant genes (Thornburg et al., Proc. Natl. Acad. Sci. U.S.A. 
84: 744, 1987; An et aL, Plant Cell 1: 1 15, 1989). For example, the 3' terminator 
region may be included in the expression vector to increase stability of the mRNA. 
One such terminator region may be derived from the PI-II terminator region of potato. 
In addition, other commonly used terminators are derived from the octopine or 

15 nopaline synthase signals. 

The plant expression vector also typically contains a dominant selectable 
marker gene used to identify those cells that have become transformed. Useful 
selectable genes for plant systems include genes encoding antibiotic resistance genes, 
for example, those encoding resistance to hygromycin, kanamycin, bleomycin, G418, 

20 streptomycin, or spectinomycin. Genes required for photosynthesis may also be used 
as selectable markers in photosynthetic-deficient strains. Alternatively, the green- 
fluorescent protein from the jellyfish Aequorea victoria may be used as a selectable 
marker (Sheen et aL, Plant J. 8:777, 1995; Chiu et al., Current Biology 6: 325, 1996). 
Finally, genes encoding herbicide resistance may be used as selectable markers; useful 

25 herbicide resistance genes include the bar gene encoding the enzyme phosphinothricin 
acetyltransferase and conferring resistance to the broad spectrum herbicide Basta® 
(Hoechst AG, Frankfurt, Germany). 
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Efficient use of selectable markers is facilitated by a determination of the 
susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed 
cells. Some useful concentrations of antibiotics for tobacco transformation include, 
5 e.g., 75-100 ftg/ml (kanamycin), 20-50 ng/ml (hygromycin), or 5-10 jig/ml 

(bleomycin). A useful strategy for selection of transformants for herbicide resistance 
is described, e.g., by Vasil et al., supra. 

It should be readily apparent to one skilled in the art of molecular biology, 
especially in the field of plant molecular biology, that the level of gene expression is 
10 dependent, not only on the combination of promoters, RNA processing signals, and 

terminator elements, but also on how these elements are used to increase the levels of 
selectable marker gene expression. 

Plant Transformation 

15 Upon construction of the plant expression vector, several standard methods are 

available for introduction of the vector into a plant host, thereby generating a 
transgenic plant. These methods include (1) Agrobacterium-mediated transformation 
(A. tumefaciens or A. rhizogenes) (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, vol 6, PWJ Rigby, ed, London, Academic Press, 1987; and Lichtenstein, 

20 CP., and Draper, J,. In: DNA Cloning, Vol II, D.M. Glover, ed, Oxford, IRI Press, 
1985)), (2) the particle delivery system (see, e.g., Gordon-Kamm et aL, Plant Cell 2: 
603 (1990); or BioRad Technical Bulletin 1687, supra), (3) microinjection protocols 
(see, e.g., Green et al., supra), (4) polyethylene glycol (PEG) procedures (see, e.g., 
Draper et aL, Plant Cell PhysioL 23: 451, 1982; or e.g., Zhang and Wu, Theor. AppL 

25 Genet. 76: 835, 1988), (5) liposome-mediated DNA uptake (see, e.g., Freeman et aL, 
Plant Cell Physiol. 25: 1353, 1984), (6) electroporation protocols (see, e.g., Gelvin et 
aL, supra; Dekeyser et aL, supra; Fromm et aL, Nature 319: 791, 1986; Sheen Plant 
Cell 2: 1027, 1990; or Jang and Sheen Plant Cell 6: 1665, 1994), and (7) the vortexing 
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method (see, e.g., Kindle supra). The method of transformation is not critical to the 
invention. Any method which provides for efficient transformation may be employed. 
As newer methods are available to transform crops or other host cells, they may be 
directly applied. 

5 The following is an example outlining one particular technique, an 

Agrobacterium-medizted plant transformation. By this technique, the general process 
for manipulating genes to be transferred into the genome of plant cells is carried out in 
two phases. First, cloning and DNA modification steps are carried out in E. coli, and 
the plasmid containing the gene construct of interest is transferred by conjugation or 

10 electroporation into Agrobacterium. Second, the resulting Agrobacterium strain is 
used to transform plant cells. Thus, for the generalized plant expression vector, the 
plasmid contains an origin of replication that allows it to replicate in Agrobacterium 
and a high copy number origin of replication functional in E. coli. This permits facile 
production and testing of transgenes in E. coli prior to transfer to Agrobacterium for 

15 subsequent introduction into plants. Resistance genes can be carried on the vector, 

one for selection in bacteria, for example, streptomycin, and another that will function 
in plants, for example, a gene encoding kanamycin resistance or herbicide resistance. 
Also present on the vector are restriction endonuclease sites for the addition of one or 
more transgenes and directional T-DNA border sequences which, when recognized by 

20 the transfer functions of Agrobacterium^ delimit the DNA region that will be 
transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell 
tungsten microprojectiles on which cloned DNA is precipitated. In the Biolistic 
Apparatus (Bio-Rad) used for the shooting, a gunpowder charge (22 caliber Power 

25 Piston Tool Charge) or an air-driven blast drives a plastic macroprojectile through a 
gun barrel. An aliquot of a suspension of tungsten particles on which DNA has been 
precipitated is placed on the front of the plastic macroprojectile. The latter is fired at 
an acrylic stopping plate that has a hole through it that is too small for the 
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macroprojectile to pass through. As a result, the plastic macroprojectile smashes 
against the stopping plate, and the tungsten microprojectiles continue toward their 
target through the hole in the plate. For the instant invention the target can be any 
plant cell, tissue, seed, or embryo. The DNA introduced into the cell on the 
5 microprojectiles becomes integrated into either the nucleus or the chloroplast. 

In general, transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene 
expression studies in plants and to produce improved plant varieties of agricultural or 
commercial interest. 

10 

Transgenic Plant Regeneration 

Plant cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant 
tissue culture techniques. It is well known in the art that various cells, tissues, and 

15 organs from almost any plant can be successfully cultured to regenerate an entire 
plant; such techniques are described, e.g., in Vasil supra; Green et al., supra; 
Weissbach and Weissbach, supra; and Gelvin et ah, supra. 

In one particular example, a cloned C-14 sterol reductase polypeptide or an 
antisense construct under the control of the 35S CaMV promoter and the nopaline 

20 synthase terminator and carrying a selectable marker (for example, kanamycin 
resistance) is transformed into Agrobacterium. Transformation of leaf discs (for 
example, of tobacco leaf discs), with vector-containing Agrobacterium is carried out 
as described by Horsch et al. {Science 221 \ 1229, 1985). Putative transformants are 
selected after a few weeks (for example, 3 to 5 weeks) on plant tissue culture media 

25 containing kanamycin (e.g. 100 ng/ml). Kanamycin-resistant shoots are then placed 
on plant tissue culture media without hormones for root initiation. Kanamycin- 
resistant plants are then selected for greenhouse growth. If desired, seeds from self- 
fertilized transgenic plants can then be sowed in a soil-less medium and grown in a 
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greenhouse. Kanamycin-resistant progeny are selected by sowing surfaced sterilized 
seeds on hormone-free kanamycin-containing media. Analysis for the integration of 
the transgene is accomplished by standard techniques (see, for example, Ausubel et al. 

supra; Gelvin et al. supra). 
5 Transgenic plants expressing the selectable marker are then screened for 

transmission of the transgene DNA by standard immunoblot and DNA detection 
techniques. Each positive transgenic plant and its transgenic progeny are unique in 
comparison to other transgenic plants established with the same transgene. Integration 
of the transgene DNA into the plant genomic DNA is in most cases random, and the 

10 site of integration can profoundly affect the levels and the tissue and developmental 
patterns of transgene expression. Consequently, a number of transgenic lines are 
usually screened for each transgene to identify and select plants with the most 
appropriate expression profiles. 

Transgenic lines are evaluated for levels of transgene expression. Expression at 

15 the RNA level is determined initially to identify and quantitate expression-positive 
plants. Standard techniques for RNA analysis are employed and include PCR 
amplification assays using oligonucleotide primers designed to amplify only transgene 
RNA templates and solution hybridization assays using transgene-specific probes (see, 
e.g., Ausubel et al., supra). The RNA-positive plants are then analyzed for protein 

20 expression by Western immunoblot analysis using C-14 sterol reductase specific 
antibodies (see, e.g., Ausubel et al., supra). In addition, in situ hybridization and 
immunocytochemistry according to standard protocols can be done using transgene- 
specific nucleotide probes and antibodies, respectively, to localize sites of expression 

within transgenic tissue. 
25 Once the recombinant C-14 sterol reductase protein is expressed in any cell or 

in a transgenic plant (for example, as described above), it may be isolated, e.g., using 
affinity chromatography. In one example, an anti-C14 sterol reductase antibody (e.g., 
produced as described in Ausubel et al., supra, or by any standard technique) may be 
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attached to a column and used to isolate the polypeptide. Lysis and fractionation of C- 
14 sterol reductase-producing cells prior to affinity chromatography may be performed 
by standard methods (see, e.g., Ausubel et al., supra). Once isolated, the recombinant 
protein can, if desired, be further purified, for example, by high performance liquid 
5 chromatography (see, e.g., Fisher, Laboratory Techniques In Biochemistry And 
Molecular Biology, eds., Work and Burdon, Elsevier, 1980). 

Polypeptides of the invention, particularly short C-14 sterol reductase protein 
fragments, can also be produced by chemical synthesis (e.g., by the methods described 
in Solid Phase Peptide Synthesis, 2nd ed M 1984 The Pierce Chemical Co., Rockford, 
10 IL). 

These general techniques of polypeptide expression and purification can also be 
used to produce and isolate useful C-14 sterol reductase fragments or analogs. 



Antibodies 

15 C-14 sterol reductases described herein (or immunogenic fragments or analogs) 

may be used to raise antibodies useful in the invention; such polypeptides may be 
produced by recombinant or peptide synthetic techniques (see, e.g., Solid Phase 
Peptide Synthesis, 2nd ed., 1984, Pierce Chemical Co., Rockford, IL; Ausubel et al., 
supra). The peptides may be coupled to a carrier protein, such as KLH as described in 

20 Ausubel et al, supra. The KLH-peptide is mixed with Freund's adjuvant and injected 
into guinea pigs, rats, or preferably rabbits. Antibodies may be purified by peptide 
antigen affinity chromatography. 

Monoclonal antibodies may be prepared using the C-14 sterol reductase 
polypeptides described above and standard hybridoma technology (see, e.g., Kohler et 

25 al., Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:51 1, 1976; Kohler et ah, 
Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T 
Cell Hybridomas, Elsevier, NY, 1981; Ausubel et al., supra). 
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Once produced, polyclonal or monoclonal antibodies are tested for specific C- 
14 sterol reductase recognition by Western blot or immunoprecipitation analysis (by 
the methods described in Ausubel et al., supra). Antibodies which specifically 
recognize C-14 sterol reductases are considered to be useful in the invention; such 
antibodies may be used, e.g., in an immunoassay to monitor the level of C-14 sterol 
reductase produced by a plant. 



Use 

Because the present invention provides for the genetic manipulation of a plant 
sterol biosynthetic pathway, this invention described is useful for a variety of 
agricultural and commercial purposes including, but not limited to, increasing crop 
yields, improving crop and ornamental quality, and reducing agricultural production 
costs. For example, the methods, DNA constructs, proteins, and transgenic plants 
described herein are useful for improving a number of fruit and vegetable 
characteristics including, but not limited to, texture, size, nutritional content, 
modification of sterol composition, disease and insect resistance, and ripening 
processes. In addition, genetic manipulation of plant sterol composition (for example, 
seed sterol composition) is useful for improving food quality and oil stability, and 
regulating the formation of compounds having an ti -nutritional properties. 

In one particular example, antisense C-14 sterol reductase sequences are useful 
for reducing the expression of C-14 sterol reductase expression in a transgenic plant. 
Such reduced expression of C-14 sterol reductase provides a means for increasing the 
life-span of such plants. Increased life-span extends reproductive period, delays 
senescence, and increases branch number for high productivity and yield. In addition, 
transgenic plants expressing antisense C-14 sterol reductase are useful for producing 
plants having reduced and more compact proportions. Such plants require less space 
and land requirements for their growth, and are more convenient and efficient to 
harvest. 
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Overproduction of the C-14 sterol reductase in transgenic plants is useful for 
enhancing the production of steroid compounds having a variety of medicinal or 
agricultural applications. For example, overproduction of mammalian steroid 
hormones in plants offers an inexpensive means for producing such hormones. 
5 In addition, C-14 sterol reductase polypeptides disclosed herein are useful for 

the development of enzyme inhibitors of the sterol biosynthetic pathway. 

Other Embodiments 

In other embodiments, the invention includes any protein which is substantially 

10 identical to a crucifer C-14 sterol reductase polypeptide (Fig. 10; SEQ ID NO:l); such 
homologs include other substantially pure naturally-occurring plant C-14 sterol 
reductase proteins as well as allelic variants; natural mutants; induced mutants; 
proteins encoded by DNA that hybridizes to the C- 1 4 sterol reductase DNA sequence 
of Fig. 14 (SEQ ID NO: 2) under high stringency conditions or, less preferably, under 

15 low stringency conditions (e.g., washing at 2X SSC at 37°C with a probe length of at 
least 10-15 nucleotides), both as described herein; and proteins specifically bound by 
antisera directed to a C-14 sterol reductase polypeptide. The term also includes 
chimeric polypeptides that include a C-14 sterol reductase portion. 

The invention further includes analogs of any naturally-occurring plant C-14 

20 sterol reductase polypeptide. Analogs can differ from the naturally-occurring C-14 
sterol reductase protein by amino acid sequence differences, by post-translational 
modifications, or by both. Analogs of the invention will generally exhibit at least 
30%, more preferably 40%, and most preferably 50% or even 80-95% identity with all 
or part of a naturally-occurring plant C-14 sterol reductase amino acid sequence. The 

25 length of sequence comparison is at least 15 amino acid residues, preferably at least 25 
amino acid residues, and more preferably more than 35 amino acid residues. 
Modifications include in vivo and in vitro chemical derivatization of polypeptides, 
e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications 
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may occur during polypeptide synthesis or processing or following treatment with 
isolated modifying enzymes. Analogs can also differ from the naturally-occurring C- 
14 sterol reductase polypeptide by alterations in primary sequence. These include 
genetic variants, both natural and induced (for example, resulting from random 
5 mutagenesis by irradiation or exposure to ethanemethylsulfate or by site-specific 

mutagenesis as described in Sambrook, Fritsch and Maniatis, Molecular Cloning: A 
Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et al., supra). Also 
included are cyclized peptides, molecules, and analogs which contain residues other 
than L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino 

10 acids, e.g., P or y amino acids. 

In addition to full-length polypeptides, the invention also includes C-14 sterol 
reductase polypeptide fragments. As used herein, the term "fragment," means at least 
20 contiguous amino acids, preferably at least 30 contiguous amino acids, more 
preferably at least 50 contiguous amino acids, and most preferably at least 60 to 80 or 

15 more contiguous amino acids. Fragments of C-14 sterol reductase polypeptides can be 
generated by methods known to those skilled in the art or may result from normal 
protein processing (e.g., removal of amino acids from the nascent polypeptide that are 
not required for biological activity or removal of amino acids by alternative mRNA 
splicing or alternative protein processing events). 

20 Furthermore, the invention includes nucleotide sequences that facilitate specific 

detection of a C-14 sterol reductase nucleic acid. Thus, C-14 sterol reductase 
sequences described herein (e.g., SEQ ID NO: 2 and 3) or portions thereof may be 
used as probes to hybridize to nucleotide sequences from other plants (e.g., dicots, 
monocots, gymnosperms, and algae) by standard hybridization techniques under 

25 conventional conditions. Sequences that hybridize to a C-14 sterol reductase coding 
sequence or its complement and that encode a C-14 sterol reductase are considered 
useful in the invention. As used herein, the term "fragment," as applied to nucleic acid 
sequences, means at least 5 contiguous nucleotides, preferably at least 10 contiguous 
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nucleotides, more preferably at least 20 to 30 contiguous nucleotides, and most 
preferably at least 40 to 80 or more contiguous nucleotides. Fragments of C-14 sterol 
reductase nucleic acid sequences can be generated by methods known to those skilled 
in the art. 

5 All publications and patent applications mentioned in this specification are 

herein incorporated by reference to the same extent as if each independent publication 
or patent application was specifically and individually indicated to be incorporated by 
reference. 

Other embodiments are within the following claims. 

0 

What is claimed is: 
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C tei ms 



1 . A substantially pure plant C-14 sterol reductase polypeptide. 



2. A substantially pure polypeptide comprising an amino acid sequence 
substantially identical to the sequence shown in Fig. 14 (SEQ ID NO: 1). 



3. The polypeptide of claim 1 or 2, wherein said polypeptide comprises the 
amino acid sequence shown in Fig. 14 (SEQ ID NO: 1). 



4. The polypeptide of claim 1 or 2, wherein said polypeptide is from a dicot. 



5. The polypeptide of claim 4, wherein said dicot is a crucifer. 



15 6. A purified DNA encoding a C-14 sterol reductase polypeptide. 



7. A purified DNA comprising a sequence substantially identical to the DNA 
sequence shown in Fig. 14 (SEQ ID NO: 2). 



20 8. The purified DNA of claim 6 or 7, wherein said DNA comprises the 

sequence shown in Fig. 14 (SEQ ID NO: 2). 



9. The purified DNA of claim 6 or 7, wherein said DNA encodes a polypeptide 
which has an amino acid sequence substantially identical to that shown in Fig. 14 
25 (SEQ ID NO: 1). 



1 0. The purified DNA of claim 9, wherein said DNA encodes a polypeptide 
which has the amino acid sequence shown in Fig. 14 (SEQ ID NO: 1). 
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1 1 . The purified DNA of claim 6 or 7, wherein said DNA is from a dicot. 

12. The purified DNA of claim 1 1 , wherein said dicot is a crucifer. 

13. A vector comprising the purified DNA of claim 6 or 7. 

14. A cell comprising the purified DNA of claim 6 or 7. 

15. A method of producing a recombinant C-14 sterol reductase polypeptide 
comprising 

providing a cell transformed with purified DNA encoding a C-14 sterol 
reductase polypeptide positioned for expression in said cell, and 

culturing said transformed cell under conditions for expressing said DNA. 

16. The method of claim 15, wherein said method further comprises recovering 
said recombinant C-14 sterol reductase polypeptide. 

17. The method of claim 15, wherein said cell is a plant cell. 

1 8. A recombinant C-14 sterol reductase produced by the method of claim 15. 

19. An isolated antibody which specifically recognizes and binds a plant C-14 
sterol reductase polypeptide. 

20. A transgenic plant which contains DNA encoding a C-14 sterol reductase 
polypeptide integrated into the genome of said plant, wherein said DNA is expressed 
in said transgenic plant. 
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21. A transgenic plant which contains DN A encoding an amino acid sequence 
substantially identical to the sequence shown in Fig. 14 (SEQ ID NO: 1) 
integrated into the genome of said plant, wherein said DNA is expressed in said 
transgenic plant. 

22. A seed from a transgenic plant of claim 20 or 21 . 



23. A cell from a transgenic plant of claim 20 or 2 1 . 



10 24. A method of detecting a C-14 sterol reductase gene in a plant cell 

comprising: 

contacting the purified DNA of claim 6 or 7 or a portion jhereof greater than 
about 12 nucleotides in length with a preparation of genomic DNA from said plant cell 
under hybridization conditions providing detection of DNA sequences having about 
15 40% or greater sequence identity to SEQ ID NO: 2. 



25. A method of isolating a C- 14 sterol reductase gene or portion thereof, said 
method comprising 

(a) amplifying said C-14 sterol reductase gene using oligonucleotide primers, 
20 wherein said primers each have regions of complementarity to opposite DNA strands 

in a region of SEQ ID NO: 2; and 

(b) isolating said C-14 sterol reductase gene or portion thereof. 



26. A method for reducing the level of a C-14 sterol reductase polypeptide in a 
25 transgenic plant cell, said method comprising expressing in a plant cell an antisense C- 
14 sterol reductase nucleic acid sequence. 
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27. The method of claim 26, wherein said antisense C-14 sterol reductase 
nucleic acid sequence is encoded by a transgene integrated into the genome of said 
transgenic plant cell. 

28. The method of claim 26, wherein said C-14 sterol reductase sequence is 
SEQ ID NO: 2 or SEQ ID NO: 3. 

29. The method of claim 26, wherein said method further comprises growing a 
transgenic plant from said transgenic plant cell, whereby the level of the C-14 sterol 
reductase polypeptide is reduced in said transgenic plant. 

30. A method for increasing the level of a C-14 sterol reductase in a transgenic 
plant cell, comprising expressing in said transgenic plant cell a nucleic acid sequence 
encoding a polypeptide substantially identical to SEQ ID NO: 1 . 

31. A transgenic plant comprising purified DNA encoding a plant C-14 sterol 
reductase polypeptide, said DNA comprising a knockout mutation in said C-14 sterol 
reductase sequence. 
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Fig. 5 
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Fig. 6A 




Fig. 6B 
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Fig. 7A 




Fig. 7D 




Fig. 7B 
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Fig. 7C 
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Fig. 8A 
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Fig. 9A Fig. 9D 
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SEQ XD NO 
SEQ XD NO 
SEQ XD NO 
SEQ XD NO 
SEQ XD NO 



1 Ath 

4 Erg24 

5 Pambe 

6 St Sl . MK STVKK SAP 

7 Ygl022 MAKDNSEKLQ VQGKKKKSKQ 



.MVSA LNFRTT^ 
MAKGA VKKEKJ^gYJwjj 



FVNFLPQGKW LKPNEI« 



134 

3! 






Ath 

gngcsiSiEv fti. 

Pambe PJEVTVLTTV 
Stsl aiAIMlSBfSC LMY • 
Ygl022 VlSMLI^lSl- LMY- 

Ath QSVYVLV «[]}. . 

Erg24 KPf^YYLOIR ELVfTVYCli. . 

Pambe KsHxLYL E. . 

Stsl GHYIYVGAYP TRYAFLVftWS 
Ygl022 YQQJVLENGIP EKYDWTlESLT 



Ath 
Erg24 



Stsl 
Ygl022 

Ath f*LS2tt 
Erg24 JCIIi 
Pambe TOFA? 
Stsl * 
Ygl022 

Ath jjLl 
Erg24 

Pambe wILg* 
Stsl "ptSK 
Ygl022 

Ath 





AX. . .WSETT 
AM. . .WlfflYV 

LVTLAL 
FLA 

YU^TPCSgOS 



MLLDMDLGV gLPS3 14 

QMERPDYFIK GFPQNFDXVE JWNGI 60 
EEGCPA KFSK .XSHXFKKTP ffFDQ. 63 
KFNDSQFIKP ESFTIAQFQN FFRTjJJ 62 
EFYHGKVALP K. .AGESWMH FXKHfl 87 



WYGIj 
WFSTl 
PC 

FWVFQIIFYY 

LVAgLGICAK LGIVSPLWA Ergle 87 
vlaHrwkJ?td GQLPELQYLY E2*HVS 146 
GWCTSXYIiL GA.S CMEFX W ENj3LQ 138 
TXVj3tiAV[5JV THVFPITTFI [wgpP 149 
TTTLVLVSHF TNLFKLYVTI Er|§3R 174 




Stsl 



Ygl022 ^®Ty!^VT PQLGV 



Ath 
Erg24 



Stsl 
Ygl022 

Ath 



Stsl 
Ygl022 



Ygl022 
Ath 



Pambe 
Stsl 
Ygl022 




Ath V. . .33 



553424 
S3IF453 
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SEQ ID NO; 
SEQ ID NO: 
SEQ ID NO: 



1 Ath 

8 Chicken 

9 Human 

Ath 
Chicken 
Human 

Ath 
Chicken 
Human 




YKDGTEIigLK 
YKDGTEL^LK 




RSRSRSPGRP 
RSRSRSPGRP 



KlEHKKDKKK I Id 
gpQADIKEAR REV) 



rrs 




0 

78 
80 



. 0 
I. 117 
t 120 



Ath . 
Chicken . 
Human I 




S9HLLE 

hB.nt 




DVgM 156 
SQ3S 159 



Ath . 
Chicken ER 
Human SYIA 




Ath 
Chicken 
Human 

Ath 
Chicken 
Human 

Ath 
Chicken 
Human 



Ath VSP 
Chicken 



D^Fj^ATVIi 
I^3j23VFIiF 




YlgVAGEILP 35 

5g5sj2S2sa 269 

iH/ggSgffl 277 



Ath 
Chicken 
Human 

Ath M^LL 
Chicken m^STO 




Human 

Ath YF 
Chicken 
Human 

Ath 
Chicken 
Human 

Ath 
Chicken 
Human 

Ath 
Chicken 
Human 

Ath 
Chick n 
Human 

Ath 369 

Chicken TWSVCYLGFK H637 
Human 615 



CAKlSe 75 
FQF 309 
Q(gV 317 

SNKGS 115 
346 
354 

.VSA®M 153 
386 
393 

190 
426 
433 



I^LEH! 



369 

CKLKYTSHLC 626 
. . 615 
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SEQ ID NO: 2 

CTGAAATTAAACAAAGCGAGAAAAGGCGATACAAAC 
1 + + + + + + 60 

GACTTTAATTTGTTTCGCTCTTTTCCGCTATGTOT 

c 



TTTGAAAATCCTTCTTCTGCTTAATGC 

61 + + + + + + 120 

AAACTTTTAGGAAGAAC^ 

c SEQ ID NO: 1 MLLDMDLGVLLPS- 



CATTGCAATCTGTTTATGTGCTGGTGTTTTACTTC 

121 + + + + + + 180 

GTAACGTTAGACAAATACACGACCACAA 

LQSVYVLVFYFVYLAVAGEI- 



TTCTCCCCGGGAAAGTTATTCGCGGCGTCCTTTTATCAGATGGCTC 

181 + + + + + + 240 

AAGAGGGGCCCTTTCAATAAGCGCCGCAGGAAAATAGTCTACC 

c LPGKVI RGVLLSDGSQLRYR- 
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GATGCAATGGTCTATTGGCACTAATATTGTTGGTAGCTATTTTGGGAATC 
241 + + + + + + 300 

CTACGTTACCAGATAACCGTGATTATAACAACCATC 
c CNGLLALILLVAILGICAKL- 



TTGGCATTGTATCACCTCTTGTGGTT 

301 + + + + + + 360 

AACCGTAACATAGTGGAGAACACCAACGCCTATCTcCTGAACTcAATGAGAGTCGATGAA 

c GIVSPLVVADRGLELLSATF- 



TTATTTTCTGTGTTTTGGTGACATTAgCATTGTATGTTACT 

361 + + + + + + 420 

AATAAAAGACACAAAACCACTGTAATcGTAACATACA 

C IFCVLVTLALYVTGRSSSNK- 



AggGtTctTCCCTAAAgCCTCATGTCTCAgGAAATCTTGTACATGACTGGTGGTTTGGAA 

421 + + + + + + 480 

TccCaAgaAGGGATTTcGGAGTACAGAGTcCTTTAGAACATGTACTGACCACCAAACCTT 

c GSSLKPHVSGNLVHDWWFGI- 
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TACAGCTGAATCCTCAGTTTATGAGCATTGATCTCAAGTTTTTCTTO 

481 + + + + + + 540 

ATGTCGACTTAGGAGTCAAATACTC 

QLNPQFMSIDLKFFFVRAGM- 



TGATGGGATGGCTGCTTATCAATCTCTCTATTCTGGCAA 

541 + + + + + + 600 

ACTACCCTACCG&CGAATAGTTAGAGAGATAAGACC^ 

MGWLLINLSILAKSVQDGSL- 



TGAGTCAGTCGATGATTCTTTACCAGATCTTCTGT 

601 + + + + + + 660 

ACTCAGTCAGCTACTAAGAAATGGTCTAGAAGACACG 

SQSMILYQIFCALYILDYFV- 



TTCATCAAgAATACATGACCTCTACGTGGGACAT^ 

661 + + + -+ + + 720 

AAGTACTTcTTATGTACTGGAGATGCACCCTGTATTAACGTCTCTCTGATcCGAAGTACG 

HEEYMTSTWDI IAERLGFML- 
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TAGTGTTTGGAGATCTCCTGTGGATTC 

721 + + + + + + 780 

ATCACAAACCTCTAGAGG&CACCTAAGGAAAG 

c VFGDLLWIPFTFSI QGWWLL- 



TGCACAACAAAGTAgAACTAACAGTTCCTGCGATTC 

781 + + + + + + 840 

ACGTGTTGTTTCATcTTGATTGTCAAGGACGCTAA 

C HNKVELTVPAIVVNCLVFLI- 



TAGGGTACATGGTTTTTCGAgGAgCT^ 

841 + + + + + + 900 

ATCCGATGTACCAAAAAGCTcCTcGATTGTTT 

c GYMVFR GANKQKHI FKKNPK- 



AAACACC^TATGGGGCAAGCCTCCAGTGGTAGT^ 

901 + + + + + + 960 

TTTGTGGTOATACCCCGTTCGGAGGTCACCA 

c TPIWGKPPVVVGGKLLVSGY- 
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ATTGGGGAATTGCAAGGCACTGTAATTACCTTGGCGACTTGATGCTTGCTC 

961 + + + + + + 1020 

TAACCCCTTAACGTCCCGTGACATTAAT^ 

c WGIARHC NYLGDLMLAIiSF S - 



GTTTGCGATGTGGAATAAGTTCTCCGGTTCCATATTTCTA 

1021 + + + + + + 1080 

CAAACGGTACAC C TT ATTCAAGAGGCCAAGGTATAAAGATGGGCTATATGGAAGACTATG 

c LPCGISSPVPYFYPIYLLIL- 



TATTGATATGGAGAGAACGAAGAGACGAGGTTCGATGTGCAGAGAAGTACAAG 

1081 + + + + + + 1140 

ATAACTATACCTCTCTTGCTTCTCTGCTCCAAGCTACACGTCTCTTC^ 

c LIWRERRDEVRCAEKYKEIW- 



GGGCAGAGTATCTTAGACTTGTCCCCTGGAGAATACTTCCTTATGTTTATTAGATGTGCC 

1141 + + + + + + 1200 

CCCGTCTCATAGAATCTGAACAGGGGACCTCTT 

c AEYLRLVPWRI LPYVY* 
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AAGAGCCAAGTCATGAAtCCTTTCAGATT^ 

1201 + + + + + + 1260 

TTCTCGGTTCAGTACTTaGGAAAGTCTAAGTGGAGAA 



TTGTrTTATTTTAGCAATGCTCGAATTG^ 

1261 + + + + + + 1320 

AACAAAATAAAATCGTTACGAGCTTAACTTTGAA^ 



CAGTCCTTAAAAAAAAAAAAAACCTAAOTTACTCCCN^ 

1321 + + + + + + 1380 

GTCAGGAATTTTTTTTTTTTTTGGATTNAATGA 



TTGTTGTAAAAATTAAANAATTACTNCCTTGANGATCTGTAAAA 

1381 + + + + 1429 

AACAACATTTTOAATTTNTTAATGANGGAACTO 
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SEQ 


ID NO: 3 


•a 
1 


TNTTGAAGGN 


TNAAGAAAAA 


NTANGGTAAG 


51 


TGTNACCACA 


ACACAACAAC 


GCCATGAACC 


101 


AGATCACCTT 


TCTTGAGTTG 


GTGGTTTCTG 


151 


TCAGTGAAGT 


CCAGATGCAG 


CNTCAAAACT 


201 


AGAGTAACAG 


CACCAACCAA 


ATCGCTATCC 


251 


ACNTTCATCG 


TCCTCATATA 


TAATCTGAGG 


301 


TGTCGCTGCC 


AATTCTTTGC 


ATCACAATAC 


351 


TCAGCTCCAG 


AAGTAAACCG 


ATGTACACGG 


401 


GAACGAAAAC 


GAATTCCCTA 


GTCCTAGAGA 


451 


CTATATCTGA 


ATGATGCATC 


ATTGCCGACA 


501 


TCATCAGGTG 


GCTCTAACGC 


AAGAGCTGAA 


551 


CGTGTTTGCC 


ATATCATTTA 


CAGCTCCAGA 


601 


TAGCTGCGTG 


AGTAATCTGA 


AGAACGTCTA 


651 


TCTTTATCTA 


TAATTGGAAG 


ATGTAGAAAC 


701 


CAATGCATCC 


AGAATCGTTG 


TCTCTAGCGA 


751 


TCATTACCTT 


CTCGACAAGA 


GTCAATTCAG 


801 


ACTCGCATCA 


GAATGTCCTT 


TGAAGTCAAG 


851 


CCGTGGAAAT 


GATTACAGAG 


TTAACCCGCA 


901 


GCAACTGAAA 


CAGGATCTGA 


TGGTGCTACA 


951 


TAATAATCGT 


GACAAGGCGG 


GTTAAACATT 


1001 


GAAAGCATAC 


GGTGCAGAAT 


ATCCGCTTCC 


1051 


CCACTGCAGC 


AGCTAAAGCA 


CTACCTTGCT 


1101 


AAATAGCATC 


ATACAAACAC 


TTTGTAATAT 


1151 


CCATTCTCCA 


CAACAGGCAA 


GTGTCTAAAC 


1201 


AAGAGCCTCA 


AGCGCCAACG 


AATCAGAAGT 


1251 


TCATAACCTT 


AGAGACCAAA 


GTTTGATCCG 


1301 


ACTCTTGTAG 


CTACATCTTT 


ATCAGTAACA 
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CTGGGNAGGA CAAGANTTCT 
NATCGGTTTC TTNTGTTTNG 
AGNTCAAGNT CCTTGTTGAC 
TTTGTCCTGT AGACNTAGCA 
GATGTAATCA AAACCTTATC 
CCGTTGTTCC ACATTGTTAT 
CCATCAGCTN TTCGAGGTTT 
CCCTTAAGGT CTTCAAATTT 
TGGGTAAGAA CTGAGCTTCC 
TTTCACTTTG AGTGTCAGAA 
TCCCAAAATT TCTGCATCAT 
ACTGTTCTCC ACCATTGAAA 
CACAAGCTGC AGCTGATCCA 
TTTCCATCAT GCATTGTATG 
TGCACATTCA GGATTCGGTG 
GAGATAAATT TTGTGCCACC 
ATTCCACTGA TTTTGTTCCC 
AATCCCTCAT CCTTTTCGCA 
AGTGCAACCT TCGATGTCTG 
NTNTCCTTCA AGGTTTCAAT 
CCATTGTTTN TCCACACCTT 
CTGCAGTTTC TCCATCCTAG 
CCAACAAAGC AATGACTTCA 
TTCCCTTGAA CCATCTTCTG 
AACAAAAATA GGATTCCTAG 
GTCTCAACCC TTCAGCAATC 
ATCCCGGAAA GAAGCGCACT 
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1351 


T6AATCA6TC 


SEQ 

AACAAACAAG 


ID NO: 3 
CATCAACACG 


CCTAGCAGCC 


ATTNTTCGAC 


1401 


AA6CATC6AA 


AACAGTAGTT 


CCTTCNAGGA 


ATAGTAAGAG 


CTTTCGATAA 


1451 


CCTAAGCTTC 


TTCGCTGTCT 


CTCTCCATTA 


GAAGGAGCTT 


GAGATTGAGG 


1501 


TTGAGGAGGA 


GGTGAATTGG 


GTTTTGAGGT 


GTTCCCANTA 


ACACTTCGAT 


1551 


TCTCTGATTG 


TACTGGTTTC 


TTAGAAGGTG 


GTGGTCCTCN 


CCGTACAGTA 


1601 


GAATTGCTTC 


TCCTCCCTGA 


TGTTGAAGAA 


GGACCCGTCG 


CTTGAGTACT 


1651 


CATATTCGGT 


CAATCTAGGG 


TTTACTTAGA 


TCCTAAATCC 


GTCANAAATG 


1701 


ATTCCTTTAG 


ATATCAAACT 


CGTCTCTGCA 


AATGAAAAAT 


TCAACCTTTA 


1751 


ATTCACAAAC 


TATTGAAATT 


TCATCTAAAG 


CACGAATCTG 


AATAAAACCC 


1801 


AATTCACAAT 


AAAGACGATT 


TGCTCTGAGA 


ATACGATGCA 


ACATACACGA 


1851 


AAAGGATTCG 


AATTTAACGG 


ACGAGGGAAA 


TGAAACAACT 


TGAAACCCTA 


1901 


AGGATTTGAG 


CAGAAGTTAT 


GTGGGAAGAT 


TGGGNATTTA 


GGGTTTACCT 


1951 


TCTTCTTTCT 


TCNTCAAGGT 


CTCTCTCTCG 


AGCACTTTCG 


TTNCCCCAAA 


2001 


AACNAACGGC 


TCTTAACAAT 


TGAGTTAANC 


CANTTATCGA 


GTTTTCATTG 


2051 


GNTGTTCCTG 


TTTCCGCGTG 


TGTGGTGGNT 


CNCCACCTCC 


TTTCTTATAA 


2101 


TCNACGACTA 


AAAATGTTAA 


ANATAANACT 


AANATTTCTT 


TCTANAAAAA 


2151 


TCGTAAAANC 


CAAATGTTTT 


TTTTTTTCTG 


ATAAATGTCT 


ATAAATCACC 


2201 


CTTTCTTTTT 


AAATAATGAA 


ATTTGATGAC 


ATTTATCTCT 


TGTATCTAGN 


2251 


AGAGTTAATG 


GCTAACATAA 


ANACCAAAAA 


AAATTAATTC 


NAATAAATAT 


2301 


GATTTGTGTG 


GGTTACATGG 


AAAAATTGTC 


AAATAATAAA 


NCAAAAAAAA 


2351 


ATTGTATAGA 


TGCAGTGCAA 


GTTGTTTCTG 


GTCAACTTGC 


CGTCGAGCCT 


2401 


CACAACTGTT 


TGTTACAAGT 


GGACTCGCAT 


GTAATTCCCT 


CTTTTAATAA 


2451 


CTTACCAGTT 


ACACCATCCA 


ACATGTGATT 


TGACAGAAAA 


ATATTTTAGT 


2501 


GAAATGTGAT 


CGGTGCAGAT 


TTTTCTATGT 


ACGTTTAAGC 


CTTTAAGGTA 


2551 


GACGTTTAAT 


CCNAAAATAT 


CCCTGAATAA 


CAACACCGAT 


TAATGGAACC 


2601 


AAGTAGATAC 


CTCCTCCGTT 


TGGATGGCTC 


AAATGCAACC 


ATGATGCAAG 


2651 


CTTTTGCGAT 


TGACCCAAAG 


TGAGAGAACT 


AGATCGAGAT 


GGATTATTCG 
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ID NO: 3 
TATAATGGCA 

X Al AAX V9\JV«A 


GCATCTTAAT 


AGTAAACAAA 




AIjwX 1 XAW-.L. 


X X AlarlarX X X X A 


f^r^TTPPTTPA 
VJVw X X W V— X X wA 


CTCTTTGCAT 

X Vv XXX VJwA X 


ACATTGTGAA 


O O ft "1 

2801 


TL 1 vji- Kite XX X 


m7\/^7\m/^/ , ^7A/* , »/ F -» 
X AbAl kjLxA^L^ 


X\ ± AV3 X UVJAAA 




TCAATAACTC 

X WAA X X»**V* * ^» 




Vjt X VjK»Av_ X X Var A 


mn ti 7\ m/^/^trn tv ft 


A A A AHrAMA AT 

AAAAUAiMAAl 


ALA X AVJ X X w 


GAAAACTAGA 

wAAAAV* A * 


O ft ft 1 


WT TV ^T^OfWr^* ^ A ^P 


AX X X vjVj 1 XwA 




wwv X O X X VjAVz 


GTGATATACA 


2951 


ft tv mft tv ft 7v ■» 

CATGAAGACA 


m tv tv ft tv tv ^rtft/■"' , ^"* 
TAACAATCCjt 


ta rp tv CI 71 n A 
A X Al7 w VjAVjrA 


A APT AOT AT m 

AAV X AVJ X A X X 


PATTAJLPPTT 

L>A1 1 AAwU X X 


O ft ft *l 

3001 


ntwiiti< iinnwut tv tv 

TTTCTCTAAA 


ft TV y™* tv nun ft raft ft 

GAGATTGTCC 


rpTV mftTv TvmftmTA 
lAlwiAl UXA 


A ATTTT AfZAT 

AAX X X lAUAl 


fZTTA A A A AAA 

\J X X AAAAAAA 


3051 


AAATggtaag 


gttaaacagg 


ccgctaggtc 


^>m^^^ m 4 s ^% <»a 

ggtt uLdCya 


4* rra t~ /rl" 9 a s s 
uy auy Uaaaa 


3101 


agtagccatc 


ttaaaataac 


agtcgtttgc 


gagacuyycc 




3151 


atgggccata 


ggctcgctca 


agttgTGCTT 


GGCAGAATTT 


AGTAACTTGG 


3201 


GGTTTTGTTA 


TCAACAATCA 


TV fflT TV ft*T*HTf* 7V TV 

ATAb XXX AAb 


ftftmmm^ ftftmft 
<l*L- X X 1ALL X V9 


P A AO A A ATOA 

LaaUAAA X 


3251 


AGAGTTTAAG 


GGTTCTTTTT 


nnmi^ mi lift ft ft ft 

GGTATTCCCG 


tv mm/™i TV ft TV ft TV TV 

ATTLAUALAA 


ftrnft tv ftr»rp^ftft 


3301 
3351 


TCATCAGAGT 
GTCAx^CTGAA 


CCACGAGCTT 

— ELL 
ATTAAACAAA 


^^ft^tTV mil TV TV TV TV 

CC CACTAAAA 

/— f /— i «k TV TV TV TV /"^ 

GC GAGAAAAG 


tv tv mmft tv tv tv TV rn 

AATTGAAAAT 

ft ft ft TV m TV ft TV TV TV 

bLbA x AL AAA 


X V»X X ww X X w X 

r»r» A m TTPf2 A A 
LuaI X X L.WTAA 


O A ft "I 

3401 


TGCTTCATCT 


TCTCCTTTbA 


tv tv tv rnrT*rnrp(^irn 
AAAXLL1 X \~ X 


TPTdPTTAAT 

1 L>1 V7 w X X AA1 


GC TGCT AG AT 


3451 


ATGGATCTCG 


GTGTTCTTCT 


mo ft tv mft tv rrurift 

TCCATCATTb 


ft tv tv mftrnftmft ti 
LAA ILlbi oA 


ftftmftmftmftmm 
VjL* IVtIL.IV-1 X 


3501 


TAGCTTTTGA 


CTGTTGCAAT 


TGTTATTGTG 


AAATTTT x X 


mftftftmj iupMift^ft 
ltbLX X X X Vjvj 


3551 


ATCAGCTTTT 


GTTAAATTCG 


■fun/1 ^% -m /^i *m fnm 

TTC CGAGATT 


TTAbuIT X A X 


ftmftftiriftftmftm 
Var X Vjrl* X Vjv» X V9 X 


3601 


TTTACTTCGT 


mm tl mnm ft n ftikT 

TTACTTGGGN 


ftft»Tmft ft ftftft TV 

C GW X Gv» w»v» A 


ft TV TV TTfTfT 


P rGGGAAAGT 

WVJwwAAAVJ X 


3651 


TATTCGCGGC 


GTC CTTTTAT 


11 ft tv rnftft ftfTift 
\JAV3 A X VjvjL X V— 


mft tv TV ftnTrpftftm 
X L.AAL X X Uv» X 


T APPG ATGCA 

X XV W— wA X wvA 


*"» T ft <V 

3701 


ATGGTATATT 


TGATTTGATT 


m tv ftnTftmftrrift rp 
TAL ILltlLl 


tv ft tv tv mrpftftrpft 
ALAA1 ILL-lvj 


AVJAVJ X w X w X w 


3751 


AGCTCGAAAG 


mnift tv mmm /"^ tv 

TTCATTTCCA 


TTAb XXX IjtVj X 


tft TV T\ f|H|T<^T TV ^ rrrrTi 

XAAX ItAAi X 


TP A GGTPTAT 

X L>AU\7 X W X XT* X 


O O ft "I 

3801 


n%nf^n tv /-*rn tv it m 

TGGCACTAAT 


ATI bll VtVj X A 


ft ft m tv *prpm»v|ftft 


CIA ATPTttTGO 

UAAl W X \J X VJ\» 


AAAACTTGGC 

AAC1AV X X 


3851 


ATTGTATCAC 


CTCTTGTAAG 


TGTAGTTACA 


AGATTTCGAT 


TGTATTTCTA 


3901 


TGAATCCGAA 


TGCTATATGC 


TATATGAATC 


CGATTGCAAT 


TGCTTTCTCA 


3951 


CACTCATTCC 


ACTGAGATGT 


TTGGTAGGTG 


GTTGCGGATA 


GAGGACTTGA 


4001 


GTTACTCTCA 


GCTACTTNNA 


TTTCTTGTGT 


TTGGGGAAGA 


TGATCAATCC 



Fig. 1 5-3 
SOBSTIIUfE SHEET (MILE 2Q 

E3NSDOC1D- <WO 9748793A1JA> 



WO 97/48793 PCT/US97/10644 

24/29 



4051 


TTAGTCCGGN 


SEQ 

GTCTTGGATT 


ID NO: 3 
TTAGNTGNGT 


TACCATCAGA 


TTNGCTTTGG 


4101 


GTGGTGTGAT 


TTGTAATCTC 


CATGATATCT 


CTTAATATTC 


TCAGGTGACA 


4151 


TTAGCATTGT 


ATGTTACTGG 


GCGAAGTTCC 


TCGAATAAGG 


GTTCTTCCCT 


4201 


AAAGCCTCAT 


GTCTCAGGAA 


ATCTTGTACA 


TGACTGGTAC 


TAACATAATA 


4251 


CAATTGTAGA 


TCTGATACTT 


TCTTGTTACA 


CAAAATGTTG 


TTAAAAGTTA 


4301 


TATATTTTGA 


CTCCTGCAAG 


AGCAAAACTA 


AGAAATAATC 


TGGTACTATA 


4351 


TAGAGTTTGA 


AACACTGAAT 


TGGACAAGAT 


GATTCTATAG 


AACTTCGTAG 


4401 


AGTGTTGAGT 


AATTTCTCCT 


AGAACGGTTG 


TAGCTTCCTC 


TTTTTTCCTT 


4451 


TTAACCGCAG 


TGACTTTAGC 


TTTTGGAACT 


TTTCTACTGA 


AACTAGAAGT 


4501 


TCTGGTTTTG 


TCTTTCACTT 


ATCTCTTCCA 


AACAACTGCT 


TCAATTTTTT 


4551 


CTCATATTGT 


TTGTTTCATG 


TGATAGGTGG 


TTTGGAATAC 


AGCTGAATCC 


4601 


TCAGTTTATG 


AGCATTGATC 


TCAAGTAATC 


CATTTTTCTG 


TTTTTTCTTC 


4651 


TATTTGTCAG 


CCAAGGCTAC 


ATCATTGCTT 


CAGTTTGTTC 


CGTACTCAAT 


4701 


CGAGTGGCAG 


TTTAATAATG 


TAATCAGCAG 


TTATGCATGG 


TTATGATGAA 


4751 


TGGGAGTTAT 


TCCTTGTGTA 


GGTTTTTCTT 


TGTCAGAGCC 


GGGATGATGG 


4801 


GATGGCTGCT 


TATCAATCTC 


TCTATTCTGG 


CAAAAAGTGT 


GCAGGATGGT 


4851 


TCCTTGAGTC 


AGTCGATGAT 


CTTTACCAGA 


TCTTCTGTGC 


GGTAAATTTG 


4901 


GTTTTTACTT 


ACAAATCTTG 


CTTCTTGAAN 


TCTGATCATC 


TGTGTTTTGT 


4951 


TAGTTTTGAT 


TAGTTTTATA 


ATTGCAGTTA 


TATATATTGG 


ATACTTTGTT 


5001 


CATGAAGAAT 


ACATGACCTC 


TACGTAAGTT 


CATGGCGTGT 


TAAGGAAACA 


5051 


CATTTGTCTT 


ACCAAAAAAT 


GACCATTTGC 


ATTATTACAT 


CTACTTTGAT 


5101 


TTTACTCTTT 


TCAGGTGGGA 


CATAATTGCA 


GAGAGACTAG 


GCTTCATGCT 


5151 


AGTGTTTGGA 


GATCTCCTGT 


GGATTCCTTT 


CACTTTTAGC 


ATTCAGGCAT 


5201 


GTAACTGTGA 


GCCTGAACAC 


AAACAAGATA 


TTAATTTATC 


TTATTGACAG 


5251 


TATCTTCTTG 


GCATGTTACA 


GTTATTCTCG 


GAAACAATAT 


TGTTCTAGAA 


5301 


TGCTTGATCA 


CTCTGTGACT 


GAATTGTCTT 


CTCTCTGGTA 


CAGGGCTGGT 


5351 


GGCTTTTGCA 


CAACAAAGTA 


GAACTAACAA 


TTCCTGCGAT 


TGTAGTCAAT 
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SEQ ID NO: 3 



5401 


TGCCTTGTCT 


TCTTGATAGG 


GTAAGTTCTG 


AGACATGGGG 


TTATTTTCCA 


5451 


TTCTTACATA 


TCTACACTAA 


GAAACCCACT 


ATTTCTTCTT 


TGGCAGGTAC 


5501 


ATGGTTTTTC 


GAGGAGCTAA 


CAAACAAAAA 


CATATCTTTA 


AGAAGAACCC 


5551 


AAAAACACCA 


ATATGGGGCA 


AGCCTCCAGT 


GGTAGTTGGT 


GGAAAGTTAC 


5601 


TGGTTTCAGG 


CTATTGGTAT 


GTTATATTTA 


TCTTCTCTTG 


TTTCTTTGCT 


5651 


TGGTTTCGCC 


ATCTCTGTGT 


TTGATTGTTC 


ATCATGCTGG 


GAATAAAGAG 


5701 


TTGAAAGTTC 


CGCAATGACA 


CATTTCCGAT 


AACTTAGGTG 


CTGTTTTGTA 


5751 


TATATGACAG 


GGGAATTGCA 


AGGCACTGTA 


ATTACCTTGG 


CGACTTGATG 


5801 


CTTGCTCTGT 


CCTTCAGTTT 


GCCATGTGGA 


ATAAGGTACT 


CCTNCTGCTT 


5851 


GAGTTCACTT 


ACAGCTACCA 


AAATCATGTA 


GAAACTAATA 


CCAATATCNA 


5901 


AACGTTCGAA 


GTTGATTTGG 


CTGACTTAAA 


GATATTGATC 


TCTAACCATC 


5951 


ATTTGAAAAG 


TCTAAAGCTT 


TCAAGTTCAT 


TTCCCAAAGC 


TGTTTTTATG 


6001 


ATATTTCGTC 


TNGTGTATTC 


TCAGTTCTCC 


GGTTCCATAT 


TTCTACCCGA 


6051 


TATACCTGCT 


GATACTATTG 


ATATGGAGAG 


AACGAAGAGA 


CGAAGTTCGA 


6101 


TGTGCAGAGA 


AGTACNAGGA 


GATATGGGCA 


GAGTATCTTA 


GACTTGTCCC 


6151 


CTGGAGAATA 


CTTCCTTATG 


TTTATTAGAT 


GTGCCAAGAG 


CCAATTCATG 


6201 


AATCCTTTCA 


GATTCATCCT 


CTTGTGTCTT 


ATTTTTTCAT 


TAAATGTGAC 


6251 


NTGAAATGAT 


CCCATTATNG CCTNTTATCA 


ATGCTTGATT 


GAAACTTTGT 


6301 


AGTACACGTT 


ELL 1 

TGAGAATTAC TTCAGTCCTT 


GTTATTATTT 


TAGCATGGAT 


6351 


ATCAACATTT 


TCGGATTTAT 


TTNTNGGGTT 


ATTTTAAAAC 


CNNAGATTAC 


6401 


CNAANAAAAC 


CATTGTTTGA 


NGTANGATAA 


TATGGACTTT 


TTACTGAAAA 


6451 


AAAATNCTAN 


TAGGGGAACA 


AATNGAAGTT 


GAATATGGCT 


GAATNTTTTT 


6501 


ATGGANAAAA 


TGGAAACTTT 


TCCCACTTTG 


AAATGACAAT 


NCAAGTTTGG 


6551 


TGGACNACTT 


AATCACTGGA 


AACGTTAATG 


GCCAACCN 
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* « « « • 



1 

3356 


C TGAAATTAAAC AAAGCGAGAAAAGGCGATAC AAAC GATTTAGAATGCTT 

IMIINIIIINIIMIINIIIIIililMMMMIIIIIIIMII! 

CTGAAATTAAACAAAGCGAGAAAAGGCGATACAAACGATTTCGAATGCTT 


50 

3405 


SEQ 
SEQ 


ID 
ID 


NO: 
NO: 


11 

10 


51 
3406 


• • • • • 
CATCTTCTCCTTTGAAAATCCTTCTTCTGCTTAATGCTGCTAGATATGGA 

llll!lllll!l!!llllllllll!llllllilll!l!llilMIIIIII 

CATCTTCTCCTTTGAAAATCCTTCTTCTGCTTAATGCTGCTAGATATGGA 


100 
3455 










101 
3456 


• « * • • 

IIMIMIIMIIIIIM IIIIIMM ! IN 

TCTCGGTGTTCTTCTTCCATCATTGCAATCTGTGAGCTGTCTCrTTAGCT 

• 
• 


131 
3505 










132 
3556 


• 

• • « • • 

lllllllllllllllllllll 
CTTTTGTTAAATTCGTTCCGAGATTTTAGGTTTATGTGCTGGTGTTTTAC 


152 
3605 


SEQ 


ID 


NO: 


12 


153 
3606 


• • • • • 

TTCGTTTACTT . GGCCGTTGCCGGAGAAATTCTCCCCGGGAAAGTTATTC 

lll!llll!irillHhl!ll!!!lllil!ll!l!!l!!!ll)IIMI! 

TTCGTTTACTTGGGNCGNTGGCGGAGAAATTCTCCCCGGGAAAGTTATTC 


201 
3655 










202 
3656 


• • • • • 
GCGGCGTCCTTTTATCAGATGGCTCTCAACTTCGTTACCGATGCAAT . . . 

IIIIIMIMUIillMMIililMiiUIIIMrilllilllM 

GCGGCGTCCTTTTATCAGATGGCTCTCAACTTCGTTACCGATGCAATGGT 

• 
• 


248 
3705 










249 


• 

• • • • * 
GGTCTATTGGCA 

MlilllMIII 

GAAAGTTCATTTCCATTAGTTTGGTTAATTCAATTTCAGGTCTATTGGCA 


260 


SEQ 


ID 


NO: 


13 


3756 


3805 










261 
3806 


• • • ■ • 
CTAATATTGTTGGTAGCTATTTTGGGAATCTGTGCAAAACTTGGCATTGT 

tllMllllllllllllilllhMllllllllllllMlllllllllll 

CTAATATTGTTGGTAGCTATTTNGGGAATCTGTGCAAAACTTGGCATTGT 


310 
3855 










311 


• » • • • 
llllllllll 

ATCACCTCTTGTAAGTGTAGTTACAAGATTTCGATTGTAOTTCTATGAAT 

• 
• 


320 










3856 


3905 










321 
3956 


• 

• • • • « 

lllllllllllllllllllllilllill 

ATTCCAC TGAGATGTTTGGTAGGTGGTTGC GGATAGAGGACTTGAGTTAC 


348 
4005 


SEQ 


ID 


NO: 


14 



• • • • • 
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349 TCTCAGCTACTTTTATTTTCTGTGTTT 375 

Mill llllllh'llll M! MM 

4006 TCTCAGCTACTTNNATTTCTTGTGTTTGGGGAAGATGATCAATCCTTAGT 4055 



376 


TGGTGACATTAgC 


388 SEQ ID NO: 15 




IIIIIIIIMII 




4106 


GTGATTTGTAATCTCCATGATATCTCTTAATATTCTCAGGTGACATTAGC 


4155 


389 


» • • • • 
ATTGTATGTTACTGGGCGAAGT t CCTCgAATAAggG t Tc t TC CCTAAAgC 


438 




tlllllllllllltll.il II II lllllllil II II III II Hill III II 




4156 


ATTGTATGTTACTGGGCGAAGTTCCTCGAATAAGGGTTCTTCCCTAAAGC 


4205 


439 


• • • • • 


468 




M 1 1 M M 1 Mill MUM II Miillll 




4206 


CTCATGTCTCAGGAAATCTTGTACATGACTGGTACTAACATAATACAATT 


4255 



469 ♦ GGTGGTTTGGAATACAGCTGAATCCTCAGT 498 SEQ ID NO: 16 

IliM MlillMUIUi ill IIUIIII 

4556 ATTGTTTGTTTCATGTGATAGGTGGTTTGGAATACAGCTGAATCCTCAGT 4605 



499 TTATGAGCATTGATCTCAA 517 

( I i !! II I I I I II II II I I 
4606 TTATGAGCATTGATCTCAAGTAATCCATTTTTCTGTTTTTTCTTCTATTT 4655 



518 GTTTTTCTTTGTCAGAGCCGGGATGATGGGATGG 551 SEQ ID NO: 17 

1 1 1 1 M M I M 1 1 M 1 1 M I M ! 1 1 ! M I M M I 

4756 GTTATTCCTTGTGTAGGTTTTTCTTTGTCAGAGCCGGGATGATGGGATGG 4805 



552 CTGCTTATCAATCTCTCTATTCTGGCAAAAAGTGTGCAGGATGGTTCCTT 601 

nniiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

4806 CTGCTTATCAATCTCTCTATTCTGGCAAAAAGTGTGCAGGATGGTTCCTT 4855 



602 GAGTCAGTCGATGATTCTTTACCAGATCTTCTGTGC 637 

I M M 1 1 M IMIMI 1 1 1 II MM Mill 1 1 II 1 1 

4856 GAGTCAGTCGATGA . TCTTTACCAGATCTTCTGTGCGGTAAATTTGGTTT 4904 



638 GTTATATATATTGGACTACTTTGTTCAT 665 SEQ ID NO: 18 

MMIIMMIIMM IMMMMMI 

4955 TTTGATTAGTTTTATAATTGC AGTTATATATATTGGA . TACTTTGTTC AT 5003 
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666 GAAgAATACATGACCTCTAC 685 

IIIIIIIIIIIIIMIIIII 
5004 GAAGAATACATGACCTCTACGTAAGTTCATGGCGTGTTAAGGAAACACAT 5053 



686 GTGGGACATAATTGCAGAGAGACTAgGCTTCATGCTAGT 724 SEQ ID NO: 19 

liillliilliillill! I MIMMINIi i i Miltl 

5104 ACTCTTTTCAGGTGGGACATAATTGCAGAGAGACTAGGCTTCATGCTAGT 5153 

• * • • ■ 
725 GTTTGGAGATCTCCTGTGGATTCCTTTCACTTTTAGCATT 764 

llllllllllllllllllllllllllllllllllllllll 

5154 GTTTGGAGATCTCCTGTGGATTCCTTTCACTTTTAGCATTCAGGCATGTA 5203 



765 CAGGGCTGGTGGC 777 SEQ ID NO: 20 

Mlllllllllll 

5304 TTGATCACTCTGTGACTGAATTGTCTTCTCTCTGGTACAGGGCTGGTGGC 5353 



778 TTTTGCACAACAAAGTAgAACTAACAGTTCCTGCGATTGTAGTCAATTGC 827 

IIIllllMlillllllllllllill MIIMIMIIIIIIIIIIIMI 

5354 TTTTGCACAACAAAGTAGAACTAACAATTCCTGCGATTGTAGTCAATTGC 5403 



828 CTTGTCTTCTTGATAG 843 

111)111 1 1 MIMM 

5404 CTTGTCTTCTTGATAGGGTAAGTTCTGAGACATGGGGTTATTTTCCATTC 5453 



844 GGTACATG 851 SEQ ID NO: 21 

lillllll 

5454 TTACATATCTACACTAAGAAACCCACTGTTTCTTCTTTGGCAGGTACATG 5503 



852 GTTTTTCGAgGAgCTAACAAACAAAAACATATCTTTAAGAAGAACCCAAA 901 

llllilllMllllllllillllllllllMllllllllIlllliMIII 

5504 GTTTTTCGAGGAGCTAACAAACAAAAACATATCTTTAAGAAGAACCCAAA 5553 



902 AACAC CAAT ATGGGG C AAGC CTC CAGTGGT AGTTGG TGGAAAGTTAC T GG 951 

1 1 1 1 1 1 1 S 1 1 i i 1 1 1 f 1 1 1 i 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 2 1 

5554 AACACCAATATGGGGCAAGCCTCCAGTGGTACTTGGTGGAAAGTTACTGG 5603 



952 TTTCAGGCTATT 963 

llllllillill 

5604 TTTCAGGCTATTGGTATGTTATATTTATCTTCTCTTGTTTCTTTGCTT GG 5653 



964 



5754 



GGGGAATTGCAAGGCAC TGTAATTACCTTGGCGACTTGATGCTT 1007 SEQ ID NO: 22 

III IIMIIIIIII IMIIIIItllllllllMllllllil II I 

ATGAC AGGGGAATTGCAAGGCACTGTAATTAC CTTGGCGAC TTGATGCTT 5803 
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1008 GCTCTGTCCTTCAGTTTGCCATGTGGAATA 1037 

IM!I MIMIIIil! IIIIIMMNIM 

5804 GCTCTGTCCTTCAGTTTGCCATGTGGAATAAGGTACTCCTNCTGCTTGAG 5853 



1038 AGTTCTCCGGTTCC ATATTTCTAC CCGAT AT 1068 SEQ ID NO: 23 

MMIUIIMIilMIMIIilllllilll 

6004 TTTCGTCTNGTGTATTCTCAGTTCTCCGGTTCCATATTTCTACCCGATAT 6053 



1069 ACCTTCTGATACTATTGATATGGAGAGAACGAAGAGACGAGGTTCGATGT 1118 

lill NIIMIIIMIIIIINIIMMIIIIIIIIMI MIIIMI! 

6054 ACCTGCTGATACTATTGATATGGAGAGAACGAAGAGACGAAGTTCGATGT 6103 



1119 GCAGAGAAGTACAAGGAGATATGGGCAGAGTATCTTAGACTTGTCCCCTG 1168 

I illll I M MhillMMI! MIIMMIMI 1 1 MiMIIMIMM 

6104 GCAGAGAAGTACNAGGAGATATGGGCAGAGTATCTTAGACTTGTCCCCTG 6153 



1169 GAGAATACTTCCTTATGTTTATTAGATGTGCCAAGAGCCAAGTCATGAAt 1218 

1 1 1 1 J 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 f I Mllllll 

6154 GAGAATACTTCCTTATGTTTATTAGATGTGCCAAGAGCCAATTCATGAAT 6203 



1219 CCTTTCAGATTCACCTCTTGTTGTCTTATTTTTTCCATAA 1258 

IIIIIIIIIMII I ! 1 1 1 1 1 1 1 1 1 1 1 li I III 

6204 CCTTTCAGATTCATCCTCTTGTGTCTTATTTTTTCATTAAATGTGACNTG 6253 



1259 TCTTGTTTTATTTTAGCAATGCTCGAATTGAAACTTTGTAG 1299 SEQ ID NO: 24 

II h hill Nlllll I lllllllllllllll 

6254 AAATGATCCCATTATNGCCTNTTATCAATGCTTG • ATTGAAACTTTGTAG 6302 



1300 TACACTTTTGAAAAATAACTTCAGTCCTT 1328 

Illll (III I III llllllllllll 
6303 TACACGTTTG . AGAATTACTTCAGTCCTT 6330 



Fig. 16-4 
SUBSTITUTE SfffT (ffiflf 2B) 



BNSDOCID <WO_9748793A1JA> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/ 10644 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C12N 9/02, 15/53, 15/63, 1/21, 15/09, 15/10; C12Q 1/68, 1/26 
US CL :526/23.2. 23.6; 435/189, 252.3.320.1 , 419, 6, 91.2 
According to International Patent Classification (IPC) or to bo th national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 526723.2, 23.6; 435/189, 252.3.320.1, 419, 6, 91.2 

_ 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

APS, MEDLINE, SCISEARCH, LIFESCI, B10TECHDS, BIOSIS, EMBASE, CAPIUS, NTIS, WPl, CABA, AGRICOLA 
search terms: sterol reductase* 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 
Y 

A 


TATON M. Microsomal A 8 14 -Steroi A 14 -Reductase in Higher 
Plants. Eur, J. Biochem. November 1989. Vol. 185. pages 
605-614. see entire document. 

GOAD L.J. Application of Sterol Synthesis Inhibitors to 
Investigate the Sterol Requirements of Protozoa and Plants. 
Biochem Soc. Trans. 1990. Vol. 18 pages 63-65. see entire 
document. 

LAI, M.H. The Identification of a Gene family in the 
Saccharomyces cerevisiae Ergosterol Biosynthesis Pathway. 
Gene 1994. Vol. 140. pages 41-49. 


1-18, 24, 25 
1-18, 24, 25 

1-18, 24, 25 


| ~] Further documents arc listed in the continuation of Box C. | | Sec patent family annex. 


• Special categoric* of cited documents: "T" laler document published after the international Tiling date, or priority 

dale oik) not in conflict with the application but cited to undcmland the 
*A* document defining the general state of the art which in not c« inside red principle or theory underlying the invention 
to be of particular relevance 

.... . ... . . , . . "X" document of pnrticular relevance; the claimed invention cannot be 

b earUr docuineni publwhed on or after the mtenumoiwl fiJmg date considered novel or cannot be considered to involve an inventive**? 

"L" document which moy throw doubt* on priority claim! s) or which in whcn Ulc document * alo « c 

cited to establish the publication date of another citation or other , ...... 

special reason (as specified) " Y " document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document is 

'O" document referring to an oral disclosure, use. exhibition or other combined with one or more other such documents, such combination 
means being obvious to a person skilled in the art 

* P* document published prior to the international filing dale but bier than • document member of the same patent family 

the priority dote claimed 


Date of the acuta/ completion of the international search 
20 AUGUST 1997 


Date of mailing of the international search report 

0 4 SEP 1997 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington. D C. 20231 
Facsimile No. (103) 305-3230 


REBECCA PRwfV ^ ' / 
Telephone No. (703) 308-0(96 



Form PCT/ISA/210 (second shcel)( July 1992)* 

BNSOOCID". <WO__9748793AlJA> 




INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/US97/10644 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. I I Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

' — * because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. I I Claims Nos.: 

" ^ because they are dependent claims and are nut dratted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 . [ [ As all required additional search fees were timely paid hy the applicant, this international search report covers all searchable 

claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. | x[ No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-18, 24 and 25 



Remark on Protest 



1 The additional search lees were accompanied hy the applicant's protest. 
| No protest accompanied the payment of additional search fees. 



Form PCT/1SA/210 (continuation of first sheet ( I ))(July 1992)' 



BNSDOCID <WO 9748793A1_IA> 



INTERNATIONAL SEARCH REPORT 




International application No. 
PCT/US97/ 10644 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1 . In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group 1, claims 1-18, 24 and 25, drawn to C-14 sterolreduciase, DNA therefore, and methods of producing C-14 sterol 
reductase. 

Group II, claim 19, drawn to C-14 sterol reductase antibodies. 
Group III, claims 20-23, 30 and 31, drawn to transgenic plants. 

Group IV, claims 26-29, drawn to a method for reducing C-14 sterol reductase expression. 

The inventions listed as Groups MV do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: The 
inventions of Groups i-III comprise different products with unrelated chemical structures. The method of group IV 
lacks a corresponding special technical feature because it neither makes nor uses the products of any of Groups I-III - 



Form PCT/ISA/2I0 (extra sheet)(July 1992)* 



BNSDOCIO: <WO 9748793A1JA> 



-WO 



2/29 



PCT/US97yj0644 







i 




BNSDOCID: <WO 9748793A1TI_> 



WO 



4/29 





PIGUM 4 




II 






BNSDOCID: <WO_9748793AlTI_> 




BNSDOCID <WO 9748793A1TI_> 




BNSDOCID- <WO 9748793A1Tt_> 



wo 9?mm 



PCT/US97/HXS44 



7/29 



J t CURE. ?A TT* ' ■ - . 1 







1 1 * 

• i • ■ " . . • • sj.-v 




7: ■*>]«»! 



#89 





•"TV 



rt«5'V- , *" :v " r, < " 



-W- — • - ■ mmmm 




v- 



-A 
... A 



v*4 



FIGURE ?C 




BNSDOCtD <WO 9748793A1TI_> 



. WO 97rt8W 



PCMJS9?/i0644 




BNSDOCID: <WO 9748793A1T|_> 




BNSDOCID <WO 9748793A1TI_> 



WO 97/4*793 



# 



PCmfS97/10644 



10/29 




BNSDOCID: <WO_9748793AlTl_> 



WO 97*48793 



11/29 



FCT/UB»7nOfi44 



chrm3 




m424 



BNSDOCID <WO__9748793A1TI_> 



12/29 



•A 5 "■ 



Sc 7 ► ,\ 
£ 3 X J X 



# 



PC1VUB97/H1M4 



- ■ i V i A 

• ... 

■» ' i »• 1 




ACS 
3TS1 





a a st v ; l ? 



V 3 



• il L j . Aj u j j r 

- - .« - FX?.?; 
3\ « - A ^ t c- jf H .H 



r ~ >■*. if 



a 



v qB s n c i 



CUR E L T V 7 C 



a 
? 



on 

-GE3S 



f ? V 7| 

v - *L 




I . M % 
!. I 1 < 



• c 



1 



s kQ s 

CH F-J 



R 

K 
K. 




» 1 1 

•I • > — r 

? » . s a c 

A3. . . VI 3 f V ; 

h H . . . M TllV 7 



H:0 lv^n 

B a -^ii 



* *fl 7 I? 

5 vv <? ? & ; 

_ ^ A'Nn 

•n it- s»- 1 1 . 1 ■ • 



1 . ■ 



v v a 

3 j l • ■ : ; y : v 

5 » • i ■- 55 2 7 t v 



: l : a' 




• — 1 



A: ft 
4L.il 




V LJ T PC 

V 1; ? f '/ 
A?3 I I 



I. V V 



1 c 
A s 

rB n. .T. 



— -r f 

•m r x 1 



3 J » It 3 3 2S . t ? « V 

:> 5 v. > -:• y. iff; :_ 3 := » 

i « - - H ' j - -- ■ ^- 

• ■ ;;s:pu? :-: r 




■ u J 

> : 
t - * 




i I i A R 
# C L S H 

5 A A v a 



Pore & 




0 3 H X 

& A L V 

& 3 H V 

P <? V L F 

P V L C '/ 




aHH; ^Wy 7 v 
cTB: rQJo v l 
7 *P vi|Jsv: 



H 

A N A C £ K. C 
A N A C A K H 






f 

.- 


c: 1. 


V 




r. !■■ 


K 1 


A 






ri:. ( 


□ 






K L 






- P' 


k r. 







z 

l y 7 v : 



2e • 

V.J 



Aih 




r ? ^ *. J M - ; 
p » r ^ 9 Y 1M 




1 ll 

75-:-?:^ 



qBk p Ht 

RCA HmC 



r. i.'l, 



7|B 2 




1>; 
j : j 




FIGURE 11 



BNSDOCID: <WO 9748793A1TI_> ' 



13/29 



:jv; 5 ** n 




wawiw H^flo BSD 
HE S B B R.BIBMflSE 



V 



Li" If K v 

i. v f r: *u 



X 



: -B * £3 £ - □ - 




ASM 




J «9 * - v 3 





3 50i 

^ tr 



i 




a 




tMflin qnim^ f|3 



C ft 



?Hh Has 

I E| " Q H ^ 



a 3 1 21 £ A ?t 




7 Gl 



SHI 



* y u 
ft m n 



AcA . . i . ■ 
rtdlkbJl i V 3 i* - 




[ D Si 



QBE* EI - - cro.n=- 



Ac* 



p H v H - iQ3' 




§ 



Hi 




Ach 




11 





chick«n 0 Lin "UNI • ypMMV (Da a| 



■j i j. 
ii f A 




L, V T L A 

g « a. EH ; 




v 
C 



At* sflit IBm v . 
Human Us El A £ . 




r a 




5 S M K g s i:"E 

5 M 5= i 



BfHr . vIBMBk 



• . ■ • • ■ 
_ .■ -i ^ ■ 

• t 17V V 



3" 
A 
V 



fliJtlilwmg r ^ vQBIaiumsm BBDSIIi 



v *. r n 



AtJi H 




l " :ri JSJli; ti J2L5 h v ^ lp 



Jti4Tid.ll 



V I . V 

v r, ^ 




mi 



Ii I V V (I C If v 



» RflvgA Ail-: in H£B 




•> - j: 



H.Lh 
Hma 



It 
A 
£ 




b hQk £ i. l; ii 

i? M nn l >.i ri 

h !■ « v ri c, n r. 



■ r • 

- r t 



3 r 




7 S -5 p > mzi f 



L 3T f 3f 
K IJ .I t .t 1 1 V I" V 
FPU "LTV TV 




1. Li "J II II 
5. L V IT B 




V ft 

H 



K C t. b. 
Il f If H- 



Ath 




O 11 !■ V 



4- n-iHir?L 
HE : B 



Ath 



3t3 

7 M 5 '■' - 1' wC7 V H 417 
dlS 



PICtTRE 12 



BNSDOCID: <WO 9748793A1TI_> 



PCT/IWT/1W44 



14/29 





BNSOOCIO: <WO 9748793AlTI_> 



WO 97/48793 PCT/US97/10644 

w 



15/29 
PICWftS / SHEET 1 



r 



p 






M 




5 




NT 


b 


B 




s 


sa 


n 






If 




I 




I 


I 


/ 


I 


I 



CACTCTAATTTCTTTCGCTCTTTTCC SCTATGTTTGCTAAAlOT 









¥ 














K 














H 




B 


fc. 


B 


M 


Tu 




fli'jD 




hA 


5 






b 


& 


bA 


£ 


at Ip 


a 


o4 


c 


r 


I 


V 


e 


eK 






r 




c 


E> 


3 


I 


r 


11 


1 


irn 


r 


II 


I 


r 



TTTCAAAATCCTTCTTCTCCVTJ^TCCrc 

51 . --r + + -4- ------- + + 22Ci 

AAACTTTTAOT AAG AAG\ACG AATTACC AC C AT CTAT ACCT AG AGT C ACAAG AA G AAG OTA 





H L L S "M 


T> L G V 


L L 








K 






c 


& 


GCft 




w 




c 




H 


AEt i 


i 




aiil 


B 






£ 


«3JI 


P 


ol4 


J 


I 


JITI 


I 


lit 













CAriX3CAATCTCTTTATaTCCTGCTGT*r^ 
i2i + + «. 13^ 

GT>ACGTTWAC 1 ^AATACACGACCACAAJ^TGAAGCAAATGAACCGGCAACGCCCTCTT f T 





Q s v y V 


L V T Y 


T V V L A v 


A 




T 












3 




F 






M 


P 


£ E SS 


n S 


c 




•% 


5 




AuTsT 


B v 


f 






avasc-crrr&o 




c i 




I 




JaJpi iFFaa 




c J 


N 


T 
-i 


r 


inirxiiir 


rrui 


I I 


I 


I 




tfitiff 


/ / 









BNSDOCID: <WO 9748793A1TI_> 



WO 97tW99 PCT/US97/( 0$44 

FIGURE 14 / SHEET 2 



181 * . * 240 





L P 0 K V r 






ft X R 


C B 


C 


H 


c 


v * 


S Av 


iT 


V 


i r 


a ii 


n£ 


i 


R D 


p 


ei 


R 


I 1 


I 21 




I 




/ 







GATGC AATCCTCTATTMe ACTAATATTCTT^ 

* 1- * * 306 



C N 


G L L A L 


I 


L L 


v a r 


r 


G t 


C 


A X 


L- 




H 








w 








B 














ti 




C 


c 


CH 


\ 




AM 


c 




*i 


D 




e 




E 




cn 




I 


pri 


CS 


11 


& 




2 


1 


iL 


e 


I 


14 


e 


uJ 




22 


I 


l 


IS 


I 


I 


II 


I 


11 


I 








/ 










/ 





TTGGCATTCTAwTCACCTCTTOT 

AAC OCTAACATACTGGAGAAC^CCAACGCCTAT^ 

G I V -~ PLVVADRGLELLSATr- 

• e « M 

«p H « B S T b 

24 p 2 £ b a, o 

IS Vi I i r q X 

21 till r 1 

/ 

3€"1 — --#■ --------- ------ ^ ■* » * • + 

AATAAAAOACACAAAACCACTCTTAAT^^ 



t 


F C V L V 


T L 


A L V 


V T G 


ft 








H 


A 










C 


1 


I 


B 


1 




M 


V 


A 


P WW 


% ?T 


A 


& 


n 


* 

l 


I 


d n2 


r 


r 


E 


1 


J 


I 


e 16 


C at 


2 


r 


I 


r 


r 


3 II 


r n 


2 


I 



/ 

AggGt TctTCCCTAAAgC CTCATCTCTCAgCAAATCTTCTAC ATCACTCCTGCJTTtcGAA 

421 * ■ * «- 460 

TccC4A^aACCCAT*TTcOGAGTACAGAGTcCTTTAGAACAT5TACTCACCACC AAACCTT* 



5 S 






K V 


s 


C H X V K 


D W W r G 


H 




s 






s 




CsKP 








K 


a 


c s 


AvpivT 






M 


X 


u P 


vKNcB 




d 


6 


r* 


r* 


3 P 










1 


4 


A n 




221121 




I 




T 
* 


I I 


I222I 



BNSOOCID. <WO 9748793A1T(_> 



1 7/29 

FIGUJtE 14 / SHE ST 3 



// // y 

TAC AGTTCAATCCTC WTTTATC WC ATTC^TC^ tGGGA 

461 * — — * -r ^ 54& 

ATCrCC^CTTAOTAGTCAAATACTCGTAACTAfiAOT 

M - 



L 


N 


P 0 


F H 5 


I DLKFFFV 


R A C 






F 












Cn 




C 


DN 


B 


B 


VUFT 


r 


V" 


» rl 


c 


c 






i 


c da 










R 


<r II 


I 


1 


mr 


I 




I IV 






// 









5-4 L ' — -4- + * * €00 

ACTACCCTACCCfcmAATAOTTAjG^ 

ILAKSVODCSL- 





<3 




L 


I N 


L S 










B 


S 


K 






K 


cbf 


EB A 


iF 


B 


TP 


iT 


ef> 




no 




aL 


h£ 


ao 




£k 




q* 


fi 


?i 


IYnA 


II 


I 


II 


rj 


ii 


IIII 








/ 


/ 


/ / 



TOACTCAGTCCATGATTrTTTAGCAJ^TCT^ 

*Ql - — - — — - — * — ^ 6*0 

ACTC A&T C AjG-£"!T ACT AACAAATG CTC T>G AACA C AC Q£AJ\ TATA TAT hAC CTGA TC- AAAC 



5 


Q S « 


I I# Y 


9 I 


F 


C 


A L 


Y I 


L 


D 


Y 


F V 






















T 














T 










t 
















A 








h 






1 


1H 


B 




P 


1C 


fi 




c 


l 


1 




a 


ab 


BBT 


M 




w 




B 


V 


1 






I 


ie> 




a 


0 


2i 


A1 


e 


i 


1 


It 


A 


I 


il 


All 


1 


& 


&* 


F 






I 


3* 


I 


I 


II 


III 


I 


I 


II 


I 


r 


X 


I 


II 



/ 

TrrATC^flAATACATCACCTCTACCTGG^ 

661 - + -r.,---^- 720 

AACTACTTcTTATCTACTTCAGAraCACCCTC^^ 

WEEYKTSTVfDI 

C BS * H C C C 

H } g*&u iT j & W v v 

p a itp3 rif & = vi i i 

L 9 SYnA H p m o J J 

I I 11TI XT XI 1 I I 

/ / / / 
TACTCTTTCGACATCTOCTGTCC^ 
"2* -•-■■-1 — + « «. i ■ 4--- ■»- +■ 7£D 

ATCACAAACCTCTAGAGGJiCACCTAAC^3^AAC 

VFGDLLWI PFTFSIQCVvJLIj- 



BNSDOCID- <WO 9748793A1TL> 



16/29 

FICimK W i SHEET * 



1 s 
C v A. p H 

v P 1 MS fib 

i <1 w ud bo 

R c w n9 eJ 

I II II It 

/ / 

TOC AC AACAAAGTAg AACTAAC ACTTCCTCCG ATTCTAGTC AATTCC CTTGTCTTCTTG A 

7S1 * —- h # filD 

AeGTCTTTGT^CATeTTC^TTG^ 

i V F L I - 







E L 


T V S> A 


I V 


V M C 












T 












t 




N 








h 




L 




C 


a 


1 


R 


AM 


T 


Ay 






a 


rn 


a 


Li 


e 






5i 




uJ 


R 


le 


I 


II 


I 


I! 


I 


II 








/ 







TAGOCTACAirxnTTTITCGAgGAgCT 
£41 * ---<+ -.«-' — +. 

K - 



o 




V 


F 


R 


G 




N K. 9 X M 


I F 


K 


K U 


P 


















M 








K 








C 


c 




C T 


ft 






C 


b 


BM 


B 


C 


Q 




a 


1H* 


* 


C 


6- 












c 


i 


B 


enp 


t 


* 

3 




i 




ml 


1 




B 


J 


r 


P1R 


r 




r 


J 


I 


II 


r 


I 


I 


I 


I 


«T 


5 


I 


I 


I 
















// 











AAACAC CAATATOCXCCAAt^CTCCAOTGGTAGTTO^^ 
901 4 .«.. ^ .4- * 960 

. v s c y - 





P 2 . 


W G 


fc P 


P 


V V 




C L 














T 






T 






T 




t 






9 




T 


b 






E 


c 




c 




PT 


Q 3 


I 


c HC 


* 


5 


V 


P 


*s 


aSt 


a 


q ia 


* 


0 


i 


4 


Op 


it A 


i 


5 nc 


p 


9 


ft 


C 


$K 




I 


7 49 


I 


I 


I 


I 


tl 


III 




r il 



/ 

ATTCCGOAATTSCAACCCACTCTAATTACCTTO^ 
561 - •----------»--------'-■-'♦ * + 4 1020 

TAACCC C-TTAACCm:CCCTG AC ATT AAT^ 

WGIARKCNYLCSLHLALSrS- 







• 




1 






N 


a 




sM 


1 


I 




as 


a 


I 


n 


Wp 


I 


I 


X 


II 


V 



BNSDOCID: <WO 9748793A1TI_> 



FIGURE 14 / SHEET 5 



1021 + + * xOgo 

CAAAeGCrTACAtCTrATrCAAJC^GGCC;^^ 

L - 



I s 


S 


P V p v 


7 V P 


1 Y L- lr 


I 


A 












LB 




M 


C 






5 ws 


H 


bT 


V 


P.T 


B 




Ta 


e& 


i 


*a 




r 6B 


1 


tq 


R 


At 




I II 


I 


12 


I 


tr 


I 


/ 








/ 





— * * — * — -«+ -i- iUG 

ATAACTATACCTCTCTTGCTTCTCTGCTrCAAO 

E 

B EcS 

S D SDC E 

in d aRr p 

t I IIS r 

/ 

OTQCACAOTATCTTMACTTGTCCCCTC;*^ 

L141 — ^ * 12Q& 

CCCGTCtXTATMAATClTUiUCAiKGGACC 

Y V Y ■ 







R L 


V 


P w * 


I h P Y 






M 








c 




HI 




K 




V 


K 


LaT ■ a 


H 


IT 


M 


i 


c 


filf re 




nC 


n 


J 


& 




H 


Ei 


1 


I 


I 


131 I 


3 


II 


I 






n 




/ 





AAGAG CC AAGTC Jl TG AA t C CTTTC A j ATTTC AC CTC TTC TTS TCTT ATTTTTT C C AT £ ATC 

1201 * * * ----- + 12€0 

TKTCOGTTC ACTACTTaCC AAAGTCTAAflTCGAG AACAAC AG AATAAAAAAGGTATT AG 



T 

4 2 
Bp c 

taO fi* 5 

EqB at •? 

113 ri r 

/ / 

rTGrrrTATVTTACCAATCCTCGAAT^ 

1261 ♦ — — -4 * * 1320 

AAC AAAATAAAATCGTT ACGA/GCTTAACTTTGAAAC ATC ATGTCAAAACTTT'TT ATTG AA 



r P KM 
C BflnGai 

» C BAEEvA*uuct*pNTCT 



BNSDOC1D <WO 9748793A1TI_> 



■ WO 



20/29 
FIGURE 14 / SHEET 6 



i J iiiiriiimiiin 

/ tUttft! / 

CACTCCWAAAAAAAAAAAAA^CCTAAOTTACTCCC^ 
1321 f + * — * i34 D 

c tc a gg aatt i p rri ■ r rrTTrrr-G gattn aatg aggctjg acc cgccggfgacc aaaata ta 

c 

T T 

« B S 

5 K S u D 

9 « 9 A n 

I r 2 II 

13 91 - — — * + 1^29 

AA C AACA TTTTT AA 'TTTT-JTr AAlX? G AA C TW C7AG AC A T7 * 1"! ' 'J T * I T 1 ' 



Encymes chAt cuts 







Aiwl 


Alv*2Gl 


AlwMT 


Ape? I 


Aval 


Bb£3 


2fcv2 


Bed 






Sfal 


Bf il 




Bpl2 








Be I 


Baa J I 






S*g3 




Ball 




BftttiSI 


BarnFl 


SMI 


BsrDl 




8at« 


cacai 




Cj*P3 


cvur 


CV1RI 






DrdUI 




Bag! 


carl 


EC&S7I 


EcoKI I 






Gdill 


Ka*m 




Hgi3I2 


Hln4l 


Kinf r 


Hphl 




Mboir 


tour 


Mb© I 


MaU 


Hap J 


Hap Air 




Hwol 


Wei r 




NlaTV 


Not I 


UspV 


P2«I 


Pvu2I 


Real 


Rear 




ScrPJ 


sfaivi 


Srna I 


Sept 


Styl 


Tall 


Taql 






Tiil 


TtlA I 


V* + Z 




Tap4CI 


Tap509Z 




TthJILII 


Xmnl 










that do not cut: 












Aat 12 




AeelSI 


Mill 


Af 1221 


Ahd2 




ApaBI 




A*£3 


Avail 


AvtII 






Ban 2 


San 1 1 






Bell 


Bgir 


Bmgfl 


BpulDT 


Bp-diiO^l 


B*aj 






BacQl 


&5 1HKAI 


Bip24I 


B*p3 41 


Bapi2&6l 








BapWI 


Bsrar 




BBflKII 


BflflSI 


BfiStlL073 


BatEII 




fiau36I 




P*al 


Drari2 


Drd2 


&*a2 




£cd4722I 


EroNI 


EcoOlOSI 




EcoRY 


F*ul 


f-eal 




Bad 


Ha*T2 


Hhal 


Kindt 


KindMI 


Kpal 


Kpnl 


Miur 




Mac I 


Natl 


Weal 


NT4aI 


MgpAZV 


Nh«I 


MrUl 




N*p2 


Pad 


M111MI 






Pwe I 






PsAAI 




Patl 


Pvul 


^LeAl 


Rflr II 


Sad I 


Sail 


SanEl 


Sap I 




s«r 




&Cc2 






5ffcAI 


Sirni 


SnaBI 


5pel 


Sphl 


SrCI 


see$3a?r 




SBtl 


SCUI 


Sun I 


s war 




Taqir 






VepI 






X*fcd 



BNSDOCID: <:WO 9748793A 1TI__> 



V 



# 



-WO 97ftS7W PCTVUS97/J 



ncona L5 / shsct 1 



1 TNTTGAACGN ltJAACAAAAA HTAHGGTAAG CTCGGNAGGA CAAGANTTCT 

SI TGTNACCACA ACACAACAAC GCCATCAACC NATCGGTTTC TTNTGTTTNG 

iai AGA-rcAccrr tcttoagttc gtggtttc^ aghtcaagnt ccttcttsac 

l^L TCACTCiAAGT CCAGATGCAG CNTCAAAAC7 TTTGTCC^GT AGACtfTACCA 

201 AGAGTAACAG CACCAACCAA ATCGCTATCC CATGTAATCA AAACCTTATC 

£S1 ACNTTCATwG TCCTCATATA TAATCTCAGG CCGTTGTTCC AC A.TT G IT AT 

301 TGTCCCTGDC AATTCTTTGC ATCACAATAC CCATCAGCTN TTCGAGCT3T 

?5l TCACCTCCAG AAGTAAACCG ATCTACACGG CCCTTAAOCT CTTCAAATTT 
GAACCAAAAC CAATTCCCTft GTCCTACAGA TGGGTAACAA CTGAGCTTCC 

4^4, CTATATCTGA ATCATGCATC ATTCCCGACA TTTCACTTTC AGTXJTCAGJvA 

SO 1 TCA7CAGCTG GCTCTAACCC AAGAGCTGAA TCCCAAAUTT TCTCCATCAT 

551 CGTCTTTOCC; AT^TCATTTA CAGCTCCAGA ACTCTTCTCC ACCATTGAAA 

adl TACCTGCGTC AGTAATCTGA ACAACGTCTA CACAACCTCC AGCTGATCCA 

6 51 TCTTTATCTA TAATTCGAAS AT3TAGAAAC TTXCCATCAT GCATT^TATC 

7 PI CAATCCATCC AGAATCGTTG TCTCTAGCCA TGCACATTCA CCATTCCCTC 
751 TCATTACCTT CTCGACAAGA OTC^ATTCAG G^GATAAJVTT TTGTGCCACC 
9C-1 ACTCGCATCA GAATCTCCTT TGAAGTCAAG ATTCCACTSA TTTToTTCCC 
3 Si COTSG AA^T GATTACACA& TTAACCCGCA AATCC-T-AT CCTTTTCGCA 
9D1 GCAACTCAAA CACCATCTCA TCCTCCTAZA AGTGCAACCT TCCATCTCT3 
*&1 TAATAATCGT GACAACGCG3 GTTAAACAfT HTNTCCTTCA AGGTTTCAAT 

1001 GAAAGETATAC C3TGCAGAAT t ATCCCCTTCI CCATTSTTTH TUCACACCTT 

105 L CCACTGCACC AGCTAAAC^A CTAC CTTGCT CTGCASTTTC TCCATCCTAG 

1101 ^AATAGCATC ATACAAACAC TTTCTAAT^T CZAAGAAAGC AATCACTT2A 
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2051 GtfTGTTCCTG TTTCCCCGTG TGTGGTGGWT CMCCACCTCC TTTCTTATAA 
TCHACGACTA AAAATGTTAA ANATAANACT iAJJATTTCTT TCTAJ4AAAAA 
TC5TAAAANC CAAATGTTTT TTTVTTTCTG ATAAATGTCT ATW^ATCACC 
CTTT^TTTTT AAATAATCAA ATTTGATCAC ATTTATCTCT TCTATCTAGN 
AGAGTTAATC GCTAACATAA ANACCAAAAA AAATTAATTC NAATAAATAT 
iATTTGTGTG GGTTACATGG AAAAATTdTC AAATAA7AAA WCAAAAAAAA 
ATTGTATAGA TGCAGTGCAA CTTCTTTCTC CTC AACTTC T CGTCGAC^CT 
CACAACTGTT TGTrAC-AAG-r GCACTCGCAT 5TAA7TCCCT CTTTTAATAA 
CTTA3CA3TT ACACCA7CCA ACATCT5AT7 7GACAGAAAA A7ATTTTACT 
GAAATGTGAT CSSTCCACAT TTTT-TATCT ACGTTrAAGC CTTTAAGGTA 
^ACGnTAAT CCNAAAATAT, CCCTSAAT^A CAACA~CGAT TAATCSAAC? 
AACTA£ATAC CTCCTCCC5TT TC3ATCSCTC AAATGCAACC ATGATGCAAG 
2651 tTTTTGCGAT TCACCCAAAG TGAGAGAACT ASATC^ACAT G3ATTATTCG 



2051 
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27 01 CAACCATTAC GSCACCCTTA TATAATCCCA GCATCTTAAT ACTWACAJiA 

275 1 AGCTTTACCC TTAQCTTTTA SCTTCCTTCA CTCTTTCCAT ACATTGTGAA 

2 SOI TCTCwCTTT TAGATCGACC atagtggaaa aaggctttca tcaataactc 

23 SI GTGGACTTCA TCAATGGTAG AAAAGANAAT ACATAGTATG GAAAACTAGA 

2501 TfiTTTGATAT ATTTCGTTCA AACTCTTATC CGCTCITGAG CTGATATAC A 

295 1 CATGAACACA TAACAATCGC AfAGCCGACA AACTAGTATT CATTAACCTT 

3 001 TTTCTCTAAA CAGATTCTCC TATCAATCTA AATTTTAGAT GTTAAAAA^A 
3 051 AAATggCaag ^ctaaacagg ccgetaggtt ^tttcacga tgAigtaaa^ 

3101 agtaqCCatC t ogtCQZttgC OAffACCg-gcc -3 00 CC4 1 C 

3151 -atg^gcrcata ggctccrccc* affttgTCCTT GGCAGAATTT AGTAACTTCG 

3 201 CCTTTTCTTA TCAACAATCA ATaGTTTAAG GCTTTACCTG CAAGAAATOA 

3 251 AGAGTTTAAG C5CTTCTTTTT CGTATTCCCC ATTCAC ACAA GTGAGcTAGC 

3 501 TCATCAGAG? CCACCACCTT CCCACTAAAA AATTCAAAAT TGTTCCTTCT 

> ELL 



3 551 GTCAljcTCAA ATTAAACAAA GCGAGAAAAG CCCATACAAA CCATTTCCAA 

3*01 T3CTTCATCT TCTCCTTTGA AAATCCTTCT TCTl3CTTAAT> GCTCCTAGAT 

3451 ATCGAT^TCG GTGTTCTTCT TCCATCATTG CAATCTCTGA GCTOTCTCTT 

3 501 TAGCTTTTGA CTGSTGCAAT "TGTTATTGTG AAATT3TTCT TCGCTTTTGG 

3 551 ATCAGCTTTT CTTAAATTCG TTCCCAGATT TTAGCTTTAT CTGCTSGTCT 

3 601 ttta^^ttcct ttacttgggn" cgntggcgga gaaattctcc ccgogaaagt 

3 SSI TATTCGCCCC GTCCTTTT^T CACATCCCTC TCAACTTCST TACCQATCCA 

37 01 ATCCTATATT TGATTTGATT TACTCTC:TCT ACW7TCCTC AGAGTCTGTG 

37 51 AGCTCGAAAG TTCATTTCCA TTACTTTCCT TAATTC AATT TCAC^TCTAT 

3601 T3GL7ACTAAT ATTGTTCCTA CCTATTTfrTCC CAATCTCTGC AAAACTTgCC 

3 651 A7TCTAT-AC CTCTTGTAAG TGTAGTTACA AGATT^CG^T TCT^TTTCTA 

^01 TGAATCCGAA TCCTATATCC TATATGAAT- CC-ATTC-CAAT' TSCTTTCTCA 

3? 51 CACTCATTCC ACTGAGATGT TTGGTAGGTG GTTGCGGATA GAGGACTT3A 

40D1 "TTACT^TCA GCTACTfTWNA TT*?£"FTGTC? TTO^GG-AAGA TGATCAATCC 

■iOSi TTASTCCGON CTCTTOGATT TTAGCsTSNGT TACCATCAGA TTNG-TTwS 

-UD1 GTOGTGTCAT TTGTAATCTC , CATCATATCT CTTAATATTC TCACCTCACA 

4151 7CAS-C AT*?ST ATS^ACTGG GCGAAGTTPCC TCGAATAACG GT^CTTCCCT 

-J ^ L, 1 J. jhiiaJl. i_ _ js L .• -_ rtu-bjnji. .- <l L J .-3 1 j-lL rt ^ *La as_ J i_>*j. * m> _ ^ «-»_rti— * mi a A 
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OiSl CAATTGTAGA TCTCXTACTT TCTTCTTACA CAAAATGTTt; TTAAAAGTTA 

4301 TATATTTTSA CTCCTCCAAG ACCAAAACTA AGAAATAATC TCCTACTATA 

4351 TA0ACTTTCA AACACJGAAT TOG*CAAGAT CATTCTATAG AACTTCGTAG 

<H91 AGTC7TGAGT AATTTCTCCT AGAACGGTTC TACCTTCCTC TTTTTTCCTT 

44 Si TTAACCGCAG TCACTTTACC TTTTCGAACT TTTCTACTCA AACTAGAACT 

4501 TCTOCTTTTC TCTTTCACTT ATCTClTCCA AACAACTCCT TCAAtTTTTT 

4551 CTCATATTGT TTCTWCATG TCATAOGTGG TTTGGAATAC AGCTSAATCC 

4601 TCAGTT7ATC ACCATTCATC TCAAGTAATC CATTTTTCTC TTTTTTCTTC 

4£51 TATTDrrCAG CCAAGGCTAC ATCATTCCTT CACTTTCTTC CGTACTCAAT 

4701 CGAGTGGCAG TSTAATAATC TAAtCAGCAG TTATCCATGC TTATCATCAA 

4?5i TOOT ACT TAT TCCTTCTGTA GGTTTTTCTT T3TCAGAGCC GOTATGATCG 

d601 CATCCCTGC? tATCAATCTC TCTATTCTGG CAAAAAtfTCT CCACCATCCT 

«651 TCCTTGAGTC AGTCGATCAT CTTXACCAGA TCTTC7CTCC CCTAAATTTG 

4 £Q1 GTTTTTACTT ACAAATCTTCS CTTCTTCAAN TCTSATCATC TCTGTTTTOT 
4?51 TAGTTTTGA^ TAGTHTATA ATTCCACTTA TATATATTOG ATACTTTGTT 
SflOl CATCAAGAAT ACATOACCTC TACGTAAOTT CATCGCGTCT TAAGGAAACA 
50SJ CAT^rCTCTT ACCAAAAAAT GACCATTTCC ATTATTACAT CTACTTTGA7 
$1D1 7TTACTCTTT TCAGGTCGGA CATAATTCCA GAGAGACTAG GCTTCATGCT 

■3i«.i agtgtttgga gatctcctgt ogattccttt cacttttagc attcaggcat 

52 Dl CTAACTGTCA CCCTGAACAC AAACAAGATA TTAATTTATC TTATTGACAC 

5 2 SI TATCTTCTTG GCATGTTACA CTTATTITTCG GAAACAATAT ?GT*TCTACAA 

5 J 01 T5CTTSATCA CTCTGTCACt CAATTcrtCTT CTCTCTGGT^ CAGCCCTCGT 
51 51 C5CTTTTGCA CAACAAAGTA OAAtfAACAA TTCCTCCGAT TGTACTCAAT 
5461 TGCCTTGTCT TCTTGATACO GTAAG77CTC ACACATCGGG TTATTTTCCA. 
5451 TTCTTACATA TCTACAC7AA GAAACC~ACT ATTTCTTCTT TG5CAGGTAC 
5501 ATCGTTTTTX7 GAGGAGCTAA CAAACAAAAA CA7ATCTTTA AGAAGAACCC 
*5?1 AAAAACACGA ATATGGGCCA ACCCTCCAS? CCTAGTrSGT GGAAACTTTAC 
■501 TGGTTTZAGG CTATTSGTA? GTTATATTTA TTTTCTCTTG TrTCTrtGCT 
5d51 TGGTTTC 5£ Z ATCTCTSTCT r TTGATT5TTC ATCATS£?5G GAATAAAGAC 
5701 TT^AAACrrC CGCAATGACA CATtTZCGAT AACTTACC7C CTCTTTTGTA 
5"i TATATCJ AC AC GSGAATTCCA ACGCACTGTA ATTACCTTCG CGACTTGATC 
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I I I I III! Ill I I : I I i I I I ' I I M I I I I I I I ? I I I I I I I I I I I I M I 
3 5 06 TTCGTTTACTTGCX5NCGNTwCCOMAAATTCTCCC CG0GAAAGTTAT7C 3t5S 

. • i • • 
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lD3ft - , . ACTTCTCCOCTTCCATATTTCTACCCGATaT- 1069 

i f m 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 

€004 TTTC C TCTtCTG T A T T CT£ ACTTC TC C OQTT C C AT ATTTC TAX: C C C AT AT 6053 

i J r - • 

IDb? ACCTTCTGATATrATTGATMO^^ 2 11B 
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6054 ACCTOTTOAT\ACTArS15ATATM&GAGA^ 

1119 GCACAGAAGT^AAtf^ATttTGraCAGAGT^ UfiB 
I | II M iUii I f I I II Ml I I I M II I lUU N f C J II I I Nil I I I I 
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1169 G AG AATACTTCCTTATOTTTTATTAG ATOTGrc 1213 
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G AGAATA.'^TTCCTTA TC TTTATTAG ATGTGC CAAG AA TTC A TG AAT 6ZQ3 

■ ■ ■ • b 
121* CCTTTCAGATTCACCTCTTGTTCTCTTATTTTTTCCA'rAA „ , , , 

M I I I I I I I It ( ( r I I I I I II LM I II I I III 
€204 CCTTT C AG ATTC A"T<! C TCTTGTGTCTTATTTTTTC A'FTAAATGTG AC?«?TG £25* 

12S5 , r T^TTX^TTTATTTT A-^CAATG C Tt G AATTOAAACT7TX2TAG 1259 
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■ ■ 
TACACrTTTC-AAAAATAACTl^AOTCCTT 1*33 
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